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(57) Abstract 



A method is described for oonstcucting lecambisant double stranded DNA vinises, e^ecially bacoloviiuses and grannlosis virnses, 
by the direct ligation of DNA fragments in vitxo. Also described are direct ligation vims vecton, which are insect vinises that have been 
modified by the inseition of at least one recognition site for a spedfic restriction ecdcHiDcIease which does not cot the viral genome, sa aa 
to fecilitalB die inserti(m of foreign DNA segments by DNA ligaticm in vitro, and to the recombinant vinises foimed by this direct figatioa. 
Further described are inodular e^qnession vectors (plasmids) that are deigned to fiidHtate die assenibly of gene e^qpresaon casseftcs or 
other DNA fragments mto virus insertion modules (as shown m the figure) which are insened subsequently into direci ligation viro vedon 
at predefined sites and in a predefined orientBtioo by figation in vitro, and tt> recombinant viruses derived fiom said ej^presston 
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GENE INSERTION BY DIRECT LIGATION IN VITRO 



Field Of The Invention 

This invention relates to a method for 
constructing reconbinant double stranded DNA insect 
viruses, especially baculoviruses and grsmulosis 
viruses, by the direct ligation of DNA fragments in 
vitro . Thiis invention further relates to direct 
ligation virus vectors, which are insect viruses that 
have been modified by the insertion of at least one 
recognition site for a specific restriction 
endonuclease which does not cut the viral genome. 
This facilitates the insertion of foreign DNA segments 
by DNA ligation in vitro . This invention also relates 
to the recombinant viruses formed by this direct 
ligation. This invention also relates to modular 
e3q>res8ion vectors (plasmids) that are designed to 
facilitate the assembly of gene expression cassettes 
or other DNA fragments into virus insertion modules 
which are subsecpiently inserted into direct ligation 
virus vectors at predefined sites and in a predefined 
orientation by ligation in vitro ^ and to recombinant 
viruses that derive from the use of such modular 
expression vectors • 

Background Of The Invention 

The following abbreviations are used 
throughout this application: 

AcBlNPV - Autoqraoha califomica nuclear polyhedrosis 
virus 

bp - base pairs 
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BEVS - baculovlrus ea^ression vector system 
ECV - extracellular virus 
GV - granulosis virus 
kD - kllodaltons 
5 NPV - nuclear polyhedrosis virus 

occ~ * occlusion negative virus (es) 
occ^ - occlusion positive virus (es) 
OV - occluded virus 
PGR - polymerase chain reaction 
10 pfu - placpie forming unit 

p.i. - post-T infection 

PIB * polyhedron inclusion body (also known as 
occlusion body) 

5' UTR: The mRNA or gene sequence corresponding to 
15 the region extending from the start site of gene 

transcription to the last base or basepair that 

precedes the initiation codon for protein synthesis. 

3' UTR: The mRNA or gene sequence corresponding to 

the region extending from the first base or basepair 
20 after the termination codon for protein synthesis to 

the last gene-encoded base at the 3' terminus of the 

mRNA. 

(+) strand: Refers to the DNA strand of a gene and Its 
flanking sequences which has the same sense as the RNA 
25 that is derived from that gene. 

(-) strand: Refers to the DNA strand of a gene and its 
flanking sequences that is complementary to the 
(<f) strand. 

30 Since the advent of recombinant DNA 

technology, there has been steady growth in the number 
of systems available for the regulated expression of 
cloned genes in prokaxyotic and exikaryotic cells. One 
eukaryotic system that has gained particularly 

35 widespread use is the baculovlrus ea^ression vector 
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system, or BEVS, developed by Smith and S\immers 
(Bibliography entry 1) . This system utilizes a 
nuclear polyhedrosis virus isolated from the alfalfa 
looper, Autoorapha californica . as a vector for the 
5 introduction and high level e3^res8ion of foreign 

genes in insect cells. 

Autoqrapha californica multicapsid nuclear 
polyhedrosis virus (AcMNPV) is the prototype virus for 
the Family Baculoviridae , These viruses have large, 

10 circular, double- stremded DNA genomes (at least 90-230 

kilobases (2)). There are two Subfamilies, 
Nudibac ulovirinae , which do not form occlusion bodies, 
and the Eubaculovirinae . which are characterized by 
their ability to form occlusion bodies in the nuclei 

15 of infected insect cells. The structural properties 

of the occlusion bodies are used to further classify 
the members of this Subfamily into two genera: the 
nuclear polyhedrosis viruses (NPVs) and the granulosis 
viruses (GVs) . 

20 As exemplified by AcMNPV, the occlusion 

bodies formed by NPVs are 1-3 microns in diameter and 
typically contain several hundred virions embedded in 
a para-crystalline matrix. Occlusion bodies are also 
referred to as either polyhedra (polyhedron is the 

25 singular term) or as polyhedron inclusion bodies 

(PIBs) . The major viral-encoded structural protein of 
the occlusion bodies is polyhedrin, which has a 
molecular weight of 29 kilodaltons (kD) (1,3). More 
than a hundred such occlusions can frecjuently be found 

30 in the nucleus of a single infected cell. GVs are 

distinguished from NPVs by the fact that their 
occlusions are much smaller emd contain only one 
virion, which is embedded in a matrix of the viral 
protein granulin. Nevertheless, the fxindamental 

35 principles of GV replication are similar to those 



described below for AcMNPV. 

Viral occlusion bodies play an essential 
role in the horizontal (insect to insect) transmission 
of Eubaculovirinae > When a larva infected with AcMNPV 
dies, large nuxnbers of occlusion bodies are left in 
the decomposing tissues. In neutral or acidic 
conditions (pH <10) , the protein matrix and outer 
calyx of the occlusion body protect the embedded 
virions against chemical degradation in the 
environment and provide limited protection against UV 
radiation. However/ when the occlusion bodies are 
ingested by another larva, they dissolve rapidly in . 
the larval midgrut, which is strongly alkaline (pH 
10.5-12), and the embedded virions are released. 
These virions then adsorb to and infect various types 
of midgut cells. 

Infected midgut cells synthesize few if any 
new occlusion bodies. Instead, they produce a second 
form of the virus, known as extracellular virus (ECV) . 
Whereas the occluded form of the virus is responsible ' 
for the horizontal transmission of the virus among 
larvae, the ECV is used to spread the infection from 
tissue to tissue internally. This is am essential 
aspect of normal viral pathogenesis and continues 
until most tissues of the larva have been infected and 
lysed. As the virus spreads internally, many of the 
infected cells, especially hemocytes and fat body 
cells, produce not only more ECV, but also copious 
amounts of occluded virus (OV) in the form of 
occlusion bodies. When the larva dies, the occlusion 
bodies are deposited in the environment and the cycle 
begins anew. 

Although ECV and OV are genetically 
identical, they are biochemically distinct. Shortly 
after the AcMNPV infects a cell, the nucleocapsid 



structure (which contains the DMA genome) migrates to 
the nucleus of the cell, where it is \incoated. This 
sets in motion a regulated cascade of viral gene 
expression which leads to the onset of viral DNA 
synthesis (at about 6 hours post-infection (p.i.)) and 
the foannation of many new nucleocapsids. ECV 
production begins at sUdout 10-13 hours p.i, with the 
budding of the nucleocapsids through the cytoplasmic 
surface of the cell. During the budding process, the 
nucleocapsids acquire a lipid membrane, or envelope, 
which is decorated with a viral glycoprotein known as 
gp64. This protein is specific to the ECV form of the 
virus and is required for ECV infectivity. The 
formation of occlusion bodies begins much later (24-36 
hours p.i.) and requires the concerted action of 
numerous specialized viral gene products, the most 
prominent of which is polyhedrin. 

The polyhedrin gene plays a central role in 
the BEVS technology. Because large amounts of 
polyhedrin are required for occlusion body formation, 
the polyhedrin gene is one of the most actively 
transcribed genes in the viral genome during the very 
late phases of virus replication. Smith and Summers 
(1) show that high level expression of a heterologous 
gene can be achieved by sxibstituting the coding region 
of the polyhedrin gene with the coding region of a 
heterologous gene of interest. Since polyhedrin is 
not required for ECV formation, the resulting virus is 
able to replicate normally in cultured insect cells. 
However, it is no longer able to produce polyhedrin 
for occlusion body formation and is therefore 
occlusion-negative (occ') . 

The BEVS has been used successfully to 
express foreign genes isolated from a wide range of 
prokaryotic and eukaryotic organisms and viruses. 



Some representative exaxnples include the human a- and 
j5- interferons, the Drosophila Krueppel gene product, 
E. coli /8-galactosidase, various HIV structural 
proteins, and a Neurospora crassa site -specific DNA 
binding protein (3) • In general, these genes may 
encode cytosolic proteins, nuclear proteins, 
mitochondrial proteins, secreted proteins or membrane- 
boxmd proteins. In most cases, the proteins are 
biologically active and undergo appropriate post- 
translational modification. Including proteolytic 
processing, glycosylation, phosphorylation, 
myristylation and palmitylation. Hence, this system . . . 
has proven to be a highly valued tool for both 
fundamental molecular research and for the production 
of proteins for commercial purposes. 

Using BEVS technology, recombinant viruses 
are produced in cultured insect cells by homologous 
DNA recombination between AcM2IPV DNA and a plasmid- 
based transfer (or transplacement) vector containing 
the heterologous gene of interest under the control of 
the polyhedrin gene promoter. To facilitate 
homologous DNA recombination the modified polyhedrin 
gene of the treuisfer vector is flanked at each end by 
several kilobases (2-4 kb is typical) of native AcMKfPV 
DNA. Many transfer vectors conforming to this general 
specification have been described (4) • 

In a typical experiment, purified AcMNPV DNA 
and transfer vector DNA are mixed together and then 
transfected into Sf9 insect cells. Once the DNA 
reaches the cell nucleus, it can be acted upon by 
cellular proteins involved in the trsoiscription, 
replication, topological management and repair of DNA. 
Most of the viral DNA is used without modification as 
a substrate for viral replication; however, a small 
fraction (typically 0.1-5%) undergoes homologous 



recombination with the transfer vector prior to the 
onset o£ virus replication. The product of this 
recombination event is a virus in which the wild type 
polyhedrin gene has been transplaced by the desired 
heterologous gene of the transfer vector. These 
recraibinant viruses can be identified visually with 
low magnification light microscopy as occ~ plaqpies in i 
standard viral plaque assay. 

An important technical disadvantage of the 
conventional homologous recoxnbination method is the 
poor efficiency of recombinant virus production. 
Recently, two significant modifications to this 
methodology have been described, which are aimed at 
increasing the representation of recombinant viruses 
among the total progeny virus produced in a typical 
CO- transf action experiment. 

Kitts et al. (5) show that linearization of 
AcMNPV viral DNA in the polyhedrin gene region prior 
to co-transf ection with the transfer vector increases 
the percentage of recombinant viruses to about 30-40% 
of the total viral progeny. The rationale for this 
approach is that linearization of the viral DHA 
provides a better substrate for homologous DMA 
recombination and disrupts the biological integrity of 
the viral genome, thereby reducing the background of 
non- recombinant progeny virus. Site-specific 
linearization of the viral DNA is achieved by 
constructing a derivative of AcHNPV that contains a 
single recognition site for restriction endonuclease 
Bsu 361, which does not cut wild type AcMNPV DNA. ' 
This site is introduced into the polyhedrin gene 
region with the aid of a transfer vector in which part 
of the polyhedrin coding region has been replaced by 
the E. coli lacZ gene. The lac Z gene encodes the 
enzyme /3-galactosidase and naturally contains a single 



Bsu 361 recognition site. Since /S-galactosidase can 
be detected in situ by chromogenic assay methods, a 
virus containing the desired Bsu 361 site is isolated 
by plague purification of occViS-gal* recombinant 
viruses • 

More recently, Kitts and Possee (6) further 
refined this approach by constructing an AcMNfPV 
derivative (designated BacPAK6) that contains three 
Bsu 361 sites, one of which lies in an adjacent AcMNPV 
gene designated ORF 1629. This site is created by 
site-directed mutagenesis in such a way that the 
functional integrity of ORF 1629 is preserved in 
undigested DNA. Since ORF 1629 is essential for 
AcMNPV replication, the only way to produce a 
replication- competent virus from Bsu 36I-dige8ted 
BacPAK6 DNA is by homologous recombination with a 
transfer vector that contains at least the missing 
segment of the ORF 1629 gene. As a result, when Bsu 
3 61 -digested BacPAK6 DNA is cotransf acted with a 
standard polyhedrin-based transfer vector (which 
typically includes a functional ORF 1629 gene) , almost 
100% of the viral progeny obtained are recombinants . 

Other strategies, based on the use of 
dominant selectable markers such as the bacterial 
neomycin resistance gene or the apoptosis- inhibiting 
viral p35 gene, have also been described for the 
selective enrichment of recombinant viruses formed by 
homologous recosibination (7) . However, regardless of 
whether this approach or the improvements of Kitts and 
Possee (6) are used to enrich for recombinant viruses, 
the current methodology for recombinant virus 
construction has one inherent limitation: its 
absolute dependence on the use of transfer vectors* 

Transfer vectors, by their very nature, are 
tied to a specific site of gene insertion in the viral 



genome. Hence, a separate transfer vector, and a 
separate screening strategy and/or selection strategy, 
must be developed for each site of insertion. At the 
very least, the reliance on transfer vector-based 
technology requires at least one additional cloning 
step for each site of insertion. This limitation is 
all the more evident as the level of engineering 
sophistication and the potential uses of recombinant 
baculovimses have continued to grow. For example, 
there is currently much interest in exploring the use 
of recombineuit baculoviruses as targeted delivery 
systems for genes that can be used to disrupt the 
normal physiology of specific Insect pests in 
agriculture. The ccnmaercial development of such viral 
insecticides will require the optimization of numerous 
parameters, including not only the choice of critical 
regulatory sequences (promoters, enhancers, 
translational signals and the like) but also the site 
of foreign gene insertion within the viral genome. 

Recently, alternative systems for 
recombinant baculovirus construction in yeast (8) or 
by site- specific DMA recombination in vitro (9) have 
been developed. Both of these systems have the 
disadvcuitage of introducing extraneous DKA into the 
viral genome. Hence, these systems are not well 
suited to applications, such as viral insecticide 
development or vaccine development, where it is not 
desirable to incorporate any extraneous DMA segments 
(e.g., plasmld vector sequences) into the final 
recombinant virus. 

The current state of the art has given rise 
to the prevailing opinion that has dominated the 
conduct of recombinant baculovirus research, which is 
as follows; "In the past, one of the disadvantages of 
these [baculovirus escpresslon] systems was the work 
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involved in isolating a recombinant baculovirus 
expression vector. The viral genome is too large (130 
kb) to manipulate directly, consequently, the standard 
method for producing virus eaqpression vectors has been 
to co-trans£ect insect cells with viral DNA and DNA of 
a trsmsfer vector modified to incorporate the foreign 
gene" (6) . 

Summary Of The Invention 

It is an object of the present invention to 
construct recombinant double stranded DNA insect , . 
viruses, such as baculoviruses, by ligation of DNA 
fragments in vitro > 

It is a further object of this invention to 
provide a means of efficiently inserting a linear DMA 
fragment into a double stranded DMA insect viral 
genome without the use of an intermediate transfer 
vector. * ' 

It is an additional object of this invention 
to provide for the ligation in vitro of a DMA fragment 
into a doxible stranded DNA insect viral genome in a 
predefined orientation. 

It is still another object of this invention 
to provide for a modular expression vector containing 
a virus insertion module with a nucleic acid sequence 
encoding a heterologous protein, where said virus 
insertion module is ligated in vitro into a double 
stranded DMA insect virus. 

It is yet another object of this invention 
to provide for a modular expression vector which 
facilitates the ready substitution of promoters, 
heterologous signals, heterologous genes, and other 
regulatory sequences in the virus insertion module. 

These objects of this invention are achieved 
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through the construction of a reconbinsuit double 
stranded DNA Insect virus obtained from a double 
stranded DNA insect virus into which is inserted at 
least one recognition site for a restriction 
endonuclease which does not cut the viral genome to 
create a direct ligation virus vector; and a DNA 
fragment containing termini selected such that when 
the direct ligation virus vector is cleaved by the 
appropriate restriction endonuclease (s) , the DNA 
fragment is ligated in vitro into the direct ligation 
virus vector. To ligate the DNA fragment into the 
direct ligation virus vector in a predefined 
orientation, two different restriction endonuclease 
recognition sites are inserted into the genome of the 
double stranded DNA insect virus. It is a preferred 
einbodiment of this invention wherein the DNA fragment 
is ligated into a region of the insect viral genome 
which is nonessential for viral replication in 
cultured cells. 

In a particularly preferred embodiment of 
this invention, the DNA fragment is contained in a 
modular expression vector. In turn, the modular 
expression vector comprises a plasmid vector 
containing a virus insertion module which con^rises, 
in the following order, a recognition site for a 
restriction endonuclease; a promoter module containing 
a promoter and a 5' untranslated region (UTR) , where 
the 5' UTR extends from the transcription start site 
to the last base pair which precedes the translation 
initiation codon for protein synthesis; a polylinker 
module to facilitate insertion of a heterologous gene; 
a 3' UTR module containing at least a site for 3' 
terminal mRNA processing and polyadenylation; and a 
recognition site for a restriction endonuclease, such 
that the two recognition sites permit the ligation in 
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vitro of the virus Insertion module Into a direct 
ligation virus vector In a predefined orientation. 

In a further embodiment of the modular 
expression vector of this Invention, the polylinker 
module of the virus insertion module is altered by the 
insertion of a nucleic acid sequence encoding a 
heterologous protein. In a particularly preferred 
embodiment of this invention, the nucleic acid 
sequence encoding a heterologous protein is selected 
from the group consisting of heterologous genes coding 
for an Insect controlling or modifying substsmce. The 
modular expression vector may further coxigprlse a 
nucleic acid sequence encoding a heterologous signal 
sequence located immediately upstream of the nucleic 
acid sequence encoding a heterologous protein. 

The virus insertion module is ligated in 
vitro into a direct ligation virus vector. The direct 
ligation virus vector comprises a double strsuided DNA 
insect virus into which is inserted at least one 
recognition site for a restriction endonuclease which 
does not cut the DNA genome of the virus, such that 
when the direct ligation virus vector is cleaved by 
the appropriate restriction endonuclease, a DNA 
fragment (such as the virus insertion module from a 
modular expression vector) is ligated in vitro into 
the direct ligation virus vector to produce a 
recombinant double stranded DNA insect virus. 



Brief Description Of The PicRires 

Figure 1 depicts a linear map of some known 
baculovirus genes of the AcMNPV genome. 

Figure 2 depicts trinucleotide freqpiencies 
of a portion of AcMNPV DNA. 
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Figure 3 depicts tetranucleotide frequencies 
of a portion of AcMNPV DNA. 

Figure 4 depicts the plasmid designated 
MW33.2, which contains a Bsu-Sse linker inserted into 
5 a unique Eco RV site of the AcMNPV Eco RI "I" 

fragment. The nucleotide sequence of the 40 base pair 
(bp) linker is also depicted. 

Figure 5 depicts the general scheme for 
assembling the Ap74-1 transfer vector used to assemble 
10 a p74 -deficient direct ligation virus vector. Figure 

5 also depicts the nucleotide sequence of the portion 
of the transfer vector containing residues 1-69 
(beginning at the ATG start codon) of the p74 open 
reading frame, a 64 bp polylinker (which includes the 
.15 Bsu-Sse linker) and residues 1287-1937 of the p74 open 

reading frame (to the TAA termination codon) • The 
leftward and rightward arrows denote the positions of 
the oligonucleotides used for the PCR-based 
identification of the p74 -deficient direct ligation 
20 virus vector. 

Figure 6 depicts a portion of a modular 
expression vector with Bsu 361 and Sse 83871 sites at 
opposite ends of an expression cassette containing a 
promoter module, a polylinker module and a 3' UTR 
25 module. The polylinker module contains a Bsp MI 

recognition site. The region bounded by the outermost 
Bsu 361 and Sse 83871 sites is defined as the virus 
insertion module . 

Figure 7 depicts the scheme for constructing 
30 the vector NW46.50, which is a Bsp Ml-based modular 

expression vector containing the AcMNPV 6.9K gene 
promoter and 3' UTR. 

Figure 8 depicts the nucleotide sequences of 
primers designated NW oligo 1 and PD oligo 23, 
35 together with the nucleotide secpxence of the fragment 
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asoplified from the AcMNPV Hind III fragment "H" 
teinplate. Positions are numbered relative to the 
start site of 6.9K gene translation, which is assigned 
position 4-1. Sequences deriving from the synthetic 
DNA primers are underlined. Nonviral polylinker 
sequences in the as^lified DNA fragment are shown in 
lower case letters. 

Figure 9 depicts the nucleotide sequences of 
primers designated NW oligo 2 and NW oligo 3, together 
with the nucleotide sequence of the fragment amplified 
from the AcMNPV Hind III fragment "H" template. 
Sequences deriving from the synthetic DNA primers are 
underlined. The asq>lified sequences shown in 
uppercase letters correspond to the complete 3' UTR of 
the AcMNPV 6.9K gene. Nonviral polylinker sequences 
in the asiplified DNA fragment are shown in lower case 
letters. 

Figure 10 depicts a portion of a modular 
expression vector with £su 361 and Sse 83871 sites at 
opposite ends of an expression cassette containing a 
promoter module, a polylinker module and a 3' UTR 
module. The polylinker module contains an Ssp 31 
recognition site. The region bounded by the outermost 
Bsu 361 and Sse 83871 sites is defined as the virus 
insertion module. 

Figure 11 depicts the nucleotide sequences ' 
of primers designated DA26FZ and DA26RZ, together with 
the nucleotide sequence of the fragment amplified from 
the AcMNPV Pst I fragment "G" template. Positions are 
ninnbered relative to the start site of DA26 gene 
translation, which is assigned position +1. Sequences 
deriving from the synthetic DNA primers are 
underlined. Nonviral polylinker sequences in the 
asxpllfied DNA fragment are shown in lower case 
letters . 
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Figure 12 depicts the nucleotide sequences 
o£ primers designated 35KPR01 and 35KPR02, together 
with the nucleotide sequence of the fragment amplified 
from the AcMNPV Hind III fragment "K" texqplate. 
Positions are numbered relative to the start site of 
35K gene translation, which is assigned position +1. 
Sequences deriving from the synthetic DMA primers are 
underlined. Nonvlral polyllnker sequences in the 
amplified DNA fragment are shown in lower case 
letters . 

Figrure^lS depicts the nucleotide sequences 
of primers designated 69KFZ and 69KRZ, together with, 
the nucleotide sequence of the fragment amplified from 
the AcMNPV Hind HI fragment "H" template. Positions 
are numbered relative to the start site of 6.9R gene 
translation, which is assigned position -i-l. Sequences 
deriving from the synthetic DNA primers are 
underlined. Nonviral polyllnker sequences in the 
amplified DMA fragment are shown in lower case 
letters. ' 

Figure 14 depicts the nucleotide sequences 
of primers designated FHF and PER, together with the 
nucleotide sequence of the fragment amplified from the 
AcMNPV Hind III fragment "F" tes^late. Positions are 
numbered relative to the start site of polyhedrin gene 
translation, which is assigned position +1. Sequences 
deriving from the synthetic DNA primers are 
underlined. Nonvlral polyllnker sequences in the 
aispllfled DNA fragment are shown in lower case 
letters. 

Figure 15 depicts the nucleotide sequences 
of primers designated DMHSP70F and DMHSP70R, together 
with the nucleotide sequence of the fragment aizplifled 
from the Drosophlla melanoaaster locus 87C1. 
Positions are numbered relative to the start site of 
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hsp70 gene translation, which is assigned position -i-l. 
The vertical arrow at position -242 marks the major 
transcription start site of the hsp70 gene. Sequences 
deriving from the synthetic LWl primers are 
underlined. Polylinker sequences in the as^lified DNA 
fragment are shown in lower case letters • 

Figure 16 depicts the nucleotide sequences 
of primers designated 35KHR5A and 35KHR5B, together 
with the nucleotide sequence of the fragment amplified 
from the AcMNPV Hind III fragment "Q" teinplate. The 
notation "35K TER" marks the termination codon for the 
35K gene. The vertical arrow marks the site of ; , 
poly (A) addition for the 35K gene. The notation 
"EcoRI" marks the position of the characteristic IR24 
repeat element of hrS. Sequences deriving from the 
synthetic DNA primers are underlined. Nonviral 
polylinker sequences in the amplified DMA fragment are 
shown in lower case letters. 

Figure 17 A depicts the polymerase chain 
reaction (PGR) strategy for the anqplif ication of a 
cuticle signal/codon optimized AalT gene, which is 
then digested with Bam HI. Figure 17B depicts the ^ 
nucleotide sequences of primers designated PD oligo 31 
and PVLReverse, which are used to amplify sequences 
from a plasmid designated pAC0055.1, as well as the 
nucleotide and amino acid sequences of the cuticle ' 
signal and AalT in the amplified product. Sequences 
deriving from the synthetic DNA primers are 
underlined. 

Figure 18 depicts a schematic representation 
and the complete predicted nucleotide sequence of a 
portion of a modular expression vector (AC0076.1) 
formed by inserting the cuticle signal/codon optimized 
AalT into pMEVl, which contains the AcMNPV DA26 
promoter • 
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Detailed Description Of The Invention 

This Invention is directed to the 
construction of a recombinauit double stranded DMA 
insect virus obtained from a double stranded DNA 
insect virus into which is inserted at least one 
recognition site for a restriction endonuclease which 
does not cut the DNA genome of the virus to create a 
direct ligation virus vector; and a DNA fragment 
containing termini selected such that when the direct 
ligation virus vector is cleaved by the appropriate 
restriction endonuclease (s) , the DNA fragment is 
ligated in vitro into the direct ligation virus 
vector . 

The double stranded DNA insect viruses which 
are modified in accordance with this invention include 
double stranded enveloped DNA viruses such as 
(Subfamily, then species) Entomopoxvirinae ( Melolontha 
melolontha entomopoxvirus) , Exibaculovirinae 
( Autoarapha califomica MNPV; Heliocoverpa zea NPV; 
Trichopluaia ni GV) , Nudibaculovirinae ( Heliocoverpa 
zea NOB) , Ichnovirus ( Campoletls sonorensis virus) / 
and Bracovirus ( Cotesia melanoscela virus) # as well as 
double stranded nonenveloped DNA viruses such as the 
Family Iridoviridae ( Chilo iridescent virus) . These 
insect viruses typically have genomes at least 90 kb 
in size (2) • 

Over 400 baculovirus isolates have been 
described. The Subfamily of double stranded DNA 
viruses Eubaculovirinae includes two genera ^ nuclear 
polyhedrosis viruses (NPVs) and granulosis viruses 
(GVs) , which are particularly useful for biological * 
control because they produce occlusion bodies in their 
life cycle. Exaixrples of NPVs include Lvmantria dispar 
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NPV (gypsy moth NPV) , Autoarapha califonxica MNPV, 
Anaorapha falcif era NPV (celery looper NPV) , 
Spodoptera littoralls NPV, Spodoptera fruaiperda NPV, 
Heliothis armiaera NPV, Mamestra brassicae NPV, 
ChorlBtoneura fumlferaQa NPV, Trichopluaia ni NPV, 
Heliocoverpa zea NPV, Rachlplusia ou NPV, etc. 
Examples of GVs include Cvdia pomonella GV (coddling 
moth GV) , Pieris brassicae GV, Trichoplusia ni GV, 
Artoaeia rapae GV, Plodia interpunctella GV (Indian 
meal moth) , etc . Exanqples of entomopox viruses 
include Melolontha melolontha EPV, Amsacta moorei EPV, 
Locus ta miaratoria EPV, Melanoplus sanauinipes EPV, 
Schistocerca areaaria EPV, Aedes aecrvrpti EPV, 
Chironomus luridus EPV, etc. 

The Autoqrapha califomip^ nuclear 
polyhedrosis virus (AcMNPV) is the prototype virus of 
the Family Baculoviridae and has a wide host range. 
The AcMNPV virus was originally isolated from 
Autoqrapha californica, a lepidopteran noctuid (which 
in its adult stage is a nocturnal moth) , commonly 
known as the alfalfa looper. AcMNPV has an 
approximately 130 kb genome (10) • This virus infects 
12 families and more than 30 species within the order 
of Lepidopteran insects (11) . 

Although the invention will be 
exemplified for Autoqrapha californica NPV (AcMPNV) , 
it is xinderstood that the concepts described herein 
are applicable for all the above- listed insect 
viruses. It is further contemplated that the present 
invention will be highly useful in improving new 
insect viruses which are not yet identified and 
classified in the literature. 

In the first step in carrying out this 
invention, a double stranded DNA insect virus is 
selected and at least one recognition site for a 
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restriction endozxuclease (s) which does not cut the DK& 
genome of the virus is inserted into the viral genome 
to create a direct ligation virus vector. The insect 
virus which will be used to construct the direct 
5 ligation virus vector can be either a native virus 

which lacks the recognition site being inserted or a 
virus which is first modified to destroy any such 
preexisting recognition sites* 

Subsequently, a DNA fragment containing 

.10 termini selected such that when the direct ligation 

virus vector is cleaved by the appropriate restriction 
endonucleaBe(s) , the DNA fragment is ligated in vitro . ; 
into the direct ligation virus vector to produce a 
recombinant double stranded D29A insect virus. As will 

15 be discussed below, inserting two different 

recognition sites into the direct ligation virus 
vector permits the insertion of a DMA fragment in a 
predefined orientation. 

It is known that there are no recognition 

20 sites for the restriction endonuclease Bsu 361 in the 

genome of the C6 strain of AcMNFV (5) . Applicants 
have also discovered that this site is also not 
present in the genome of the E2 strain of the virus. 
Furthermore, Applicants have discovered that there are 

25 no recognition sites for the restriction endonuclease 

Sse 83871 in the genome of AcMNPV strain E2. It is 
also known that there are no recognition sites for the 
restriction endonuclease Sma I in the genome of ' 
Spodoptera f ruaiperda NPV (12) . 

30 Other recognition sites lacking in these 

viruses, as well as other viruses, are identified by 
first analyzing tri- emd tetranucleotide sequence 
frequencies represented in sequenced viral DMA. Codon 
frequency tables are constructed for each viral 

35 genome, such as the tables for portions of AdMNPV 
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depicted in Figures 2 and 3 . The frequency data is 
used to estimate the likelihood that other enzymes, 
including those with non-palindromic recognition 
sequences, might be useful for introducing unique 
restriction sites into the viral genome. The 
available viral DNA sequences are then searched for 
the recognition sequences of the candidate enzymes. 
If no such sequences are found, the candidate enzyme 
is tested for its inability to cleave the viral 
genome. If the enzyme cannot digest the viral genome, 
it can be used for the construction of a direct 
ligation virus vector. 

Even if sequence information is lacking for 
a given virus, known secpience information in related 
15 viruses is used to identify candidate enzymes. For 

exaxE^le, the trinucleotide frequencies listed in 
Figure 2 for AcMNPV suggest that the recognition site 
for Sma I (CCCGOG) should occur infrequently in the 
AcMNPV genome. While there are four such sites in 
AcMNPV, there is only one such site in Spodoptera 
littoralis NPV and none in Spodoptera fruqiperda NPV. 

To ligate the DNA fragment into the direct 
ligation virus vector in a predefined orientation, two 
different restriction endonuclease recognition sites 
are inserted into the doxible stranded DNA insect viral 
genome. In a preferred embodiment of this invention, 
the DNA fragment is inserted at any region of the 
viral genome of the direct ligation virus vector which 
is nonessential for viral replication in cultured 
cells. Exaniples 1-3 below describe the construction 
of two different direct ligation AcMNPV viral vectors 
where the DNA fragment will be inserted in a 
nonessential region . 

In Examples 1 and 2, unique Bsu 361 and Sse 
35 83871 recognition sites are engineered into a viral 
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construct at the \inlgue Eco RV restriction enzyme 
recognition site present in the AcMNPV EcoRI "I" 
fragment. This Eco RV site is located 92 bp upstream 
of the trauislation initiation codon of the AcMNPV 
polyhedrin gene. The direct ligation virus vector is 
obtained by homologous DNA recombination in cultured 
insect cells cotransf ected with the transfer vector 
containing the Bsu 36I/S6e 83871 sites and viral DNA 
modified by substitution of part of the polyhedrin 
gene so as to be unable to produce occlusion bodies. 
Because the transfer vector contains a functional 
polyhedrin gene, homologous recombination yields 
occlusions-positive viruses which are readily isolated 
by plague purification. One such virus is the direct 
ligation virus vector designated 6.2.1. 

In Example 3, unique Bsu 361 and Sse 83871 
recognition sites are engineered into the p74 gene 
region of a transfer vector designated Ap74-1. The 
direct ligation virus vector designated A4000 is 
obtained by homologous recombination in cultured 
insect cells cotransf ected with the Ap74-1 transfer 
vector and wild- type AcMNPV (strain E2) viral DNA. ' 
Although A4000 is still able to replicate, this 
disruption of the integrity of the p74 open reading 
frame results in the formation of a p74«-defective 
virus which is unable to produce orally infectious 
occlusion bodies (13) . 

The direct ligation virus vector is ligated 
in vitro to a DNA fragment to produce the recombinant 
double stranded DNA insect virus. The termini of the 
DNA fragment should be cosgpatible with the termini 
formed by the digestion of the direct ligation virus 
vector with the one or two uniquely cutting 
restriction enzymes. Compatible termini are those 
which are readily joined by DNA ligase in vitro; 
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Alternatively, the DNA termini can be modified, for 
exaii5)le, by filling in one or more bases to make their 
ends compatible. 

In a particularly preferred embodiment of 
5 this invention, the DNA fragment is contained in a 

modular expression vector. In turn, the modular 
esqpression vector cosqprises a plasmid vector 
containing a virus insertion module which comprises, 
in the following order, a recognition site for a 

10 restriction endonuclease; a promoter module containing 

a promoter and a 5' untranslated region (UTR) , where 
the 5' UTR extends from the transcription start site 
to the last base pair which precedes the translation 
initiation codon for protein synthesis; a polylinker 

15 module to facilitate insertion of a heterologous gene; 

a 3' UTR module containing at least a site for 3' 
terminal mRHA processing and polyadenylation; and a 
recognition site for a restriction endonuclease, such 
that the two recognition sites permit the ligation in 

20 vitro of the virus insertion module into a direct 

ligation virus vector. 

Typically, the virus insertion module is 
contained in a plasmid vector. For use in direct 
ligation, the plasmid vector is digested with the 

25 unique restriction endonuclease (s) to excise the virus 

insertion module, which is then inserted by ligation 
in vitro into a direct ligation virus vector (such as: 
AcMNPV strains 6.2.1 or A4000) previously digested 
with the same unique restriction endonuclease (s) . 

30 Examples 4-8 below describe the construction of a 

variety of modular expression vectors containing a 
virus insertion module. 

Examples 9 and 10 describe the ligation in 
vitro of a DNA fragment from the modular expression 

35 vector NW44.1 into 6.2.1 viral DNA. The two 
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coii«>onents are each linearized by digestion with Bsu 
361 and Sse 83871 and the small fragments are 
discarded. The linearized components are then ligated 
with T4 DNA ligase. The resulting recombinant virus 
contains the desired insert from the DNA fragment of 
NW44.1, which indicates that the direct ligation 
procedure is successful. 

Turning to the specific elements of the 
virus insertion module, the recognition sites at the . 
5' and 3' ends are ae described previously. The 
promoter module contains a promoter and a 5' 
untranslated region (5' UTR) , where the 5' UTR extends 
from the transcription start site to the last base 
pair which precedes the translation initiation codon 
15 for protein synthesis. For use with AcMHPV, a variety 

of AcMMPV promoters and 5' TJTR are used, including the 
"late" 6.9K promoter, the "early" DA26 and 35K 
promoters, and the "very late" polyhedrin promoter, 
in some cases, it may be advantageous to use an non- 
viral promoter rather than a viral promoter. An 
example of such a non-viral promoter is the Drosophil^ 
n>>.lMioaaster hsp70 (major heat shock) gene promoter, 
synthetic promoters and chimeric promoters are also 
used. 

It is not necessary that a complete 5' UTR 
be included in the promoter module. It is merely 
required to have a sufficient portion of the 5' UTR to 
allow translation of the heterologous gene in the 

polylinker module. 

The polylinker module serves as the 
framework into which a heterologous gene is inserted. 
The insertion of a nucleic acid sequence encoding a 
heterologous protein necessarily alters the polylinker 
module, but does not alter the structure or function 
35 of the promoter module. Any heterologous gene which 
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contains an open reading frame flanked by in- frame 
translation initiation and termination codons may be 
inserted into the polylinker module. Therefore, all 
the heterologous genes which have been expressed in 
the BEVS are also es^ressible in accordance with this 
invention. 

In a particularly preferred embodiment of 
this invention, the nucleic acid sequence encoding a 
heterologous protein is selected from the group 
consisting of heterologous genes coding for an insect 
controlling or modifying substance. The insect 
controlling or modifying substance is selected from . 
the group consisting of toxins, neuropeptides and 
hormones, and enzymes. 

Such toxins include a toxin from the mite 
species Py emotes tritici (14) , the toxin AalT from 
Androctonup australis (15) , a toxin isolated from 
spider venom (16), a toxin from Bacillus thurinoienais 
subsp. aizawal (17), and a toxin from Bacillus 
thurinaiensis CrylVD (18) . Such neuropeptides and 
hormones include eclosion hormone (19) , 
prothoracicotropic hormone, adipokinetic hormone, 
diuretic hormone and proctolin (20) . An example of an 
enzyme is juvenile hormone esterase (21) . 

Although the invention will be exemplified 
for AalT, it is understood that the concepts described 
herein are applicable for all the above-listed insect 
controlling or modifying s\abstances, as well as for 
other heterologous proteins. The native nucleotide 
sequence for the gene encoding AalT (SEQ ID NO: 45) may 
be used. However, a modified nucleotide seG[uence may 
also be used. 

The degeneracy of the genetic code permits 
variations of the nucleotide sequence, while still 
producing a polypeptide having the identical amino 
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acid secjuence ae the polypeptide encoded by the native 
DNA sequence. The procedure known as codon 
optimization provides one with a means of designing 
such an altered DNA sequence. 

The design of codon optimized genes should 
take into accoTint a variety of factors. Including the 
frec[uency of codon usage in an organism, nearest 
neighbor frequencies, RNA stability, the potential for 
secondary structure formation, the route of synthesis 
and the Intended future DNA manipulations of that 
gene. One such codon optimized AalT nucleotide 
sequence is that set forth in SEQ ID MO:29, 
nucleotides 49-258. 

In some cases, the heterologous gene 
inserted into the polylinker module may include a 
nucleotide sequence encoding a signal peptide. Signal 
sequences are required for a cos^lex series of post- 
translational processing steps which result in 
secretion of a protein. If an intact signal sequence 
is present, the protein being expressed enters the 
lumen of the rough endoplasmic reticulum and is then 
transported through the Oolgi apparatus to secretory 
vesicles and is finally transported out of the cell. 
Generally, the signal sequence immediately follows the 
initiation codon and encodes a signal peptide at the 
amino- terminal end of the protein to be secreted, in 
most cases, the signal secpience is cleaved off by a 
specific protease, called a signal peptidase. Signal 
sequences ixnprove the processing and export efficiency 
of recombinant protein expression using viral 
expression vectors. Where the heterologous protein is 
an insect controlling protein, optimized expression of 
the insect controlling protein using an appropriate 
signal sequence achieves more rapid lethality than 
wild- type insect virus. 



If the native AalT gene is used, it 
typically will be immediately downstream of the native 
AalT signal peptide, which is encoded by the 
nucleotide sequence of SEQ ID NO:28. However, it is 
possible to use a heterologous signal peptide, 
particularly a signal peptide from an insect species. 
Seven such insect signal peptides are as follows 
(listed by type, species, codon optimized and native 
sequence identification nmnbers) : the cuticle signal 
sequence from Drosophila melanoaaster (SEQ ID N0S:29, 
nucleotides 1-48; 40) , the chorion signal sequence from 
Bombvx mori (SEQ ID NOS:38,39}, the apolipophorin 
signal sequence from Manduca sexta (SEQ ID NOS:36,37), 
the sex specific signal sequence from Bombyx mori (SEQ 
ID 1105:43,44} , the adipokinetic hormone signal 
sequence from Manduca sexta (SEQ ID N0S:34,35}, the 
pBMHPC-12 signal sequence from Boanbvx mori (SEQ ID 
NOS:32,33) and the esterase-6 signal sequence from 
Drosophila melanoaaster (SEQ ID NOS:41,42) • 

In Exaa^le 11 below, fragments derived from 
various modular expression vectors containing the gene 
for AalT are ligated in vitro into the direct ligation 
virus vectors 6.2.1 and A4000. Seven of the eight 
resulting recombinant viruses test positive by PGR for 
the formation of derivatives of 6.2.1 and A4000, 
indicating that the direct ligation is successful. 
Further confirmation is provided by injection and oral 
feeding bioassays with Heliothts virescene larvae. 
Greater than 95% of all responding larvae infected 
with viruses containing the AalT gene exhibit 
contractile paralysis prior to death. Moreover, all* 
of the recombinant AalT viruses have a shorter mean 
response time (RT50) than wild type AcMNPV (strain E2) • 
Further engineering of the polyl inker module 
is also provided herein. The insertion of an 
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additional restriction endonuclease recognition site 
in the virus insertion module facilitates the 
"perfect" fusion of the translation start site of the 
heterologous gene with the 3' terminus of the promoter 
module. Specifically, the additional site is located 
an appropriate distance downstream of the 3' terminus 
of the 5' UTR and the restriction endonuclease is of 
the type which cuts both strands of DJSUi at sites which 
lie outside of its recognition site. 

The position and orientation of the 
recognition site are set, such that when the digestion 
products are treated with the Klenow fragment of E. 
£oli DNA polymerase I in the presence of dNTPs, one 
DNA end produced by this process corresponds exactly 
to the 3' end of the promoter module. This fragment 
is then joined directly to the 5' end of the 
heterologous gene without the introduction of 
extrameous linker sequences between the 3' terminus of 
the 5' UTR and the translation initiation codon. Two 
such recognition sites are those for the restriction 
endonucleases Bsp MI (Figure 6) and Esp 31 (Figure 
10) . 

The 3 ' UTR module of the virus insertion 
module contains at least a site for 3' terminal mRKA 
processing and polyadenylation. This module may also 
contain other types of regulatory sequences, such as 
enhancers. Examples of suitable 3' UTRs include the 
AcMNPV 6.9K 3' UTR and the 3' terminus of the AcMNPV 
35K gene with the AcMNPV homologous region 5 (hrS) . 
The region hr5 e nh ances the transcriptional activity 
of the "early" 35K gene and may be useful for other 
genetic constructs (22) . 

Yet further engineering of the virus 
insertion module is part of this invention. A Stu I 
restriction endonuclease recognition site is inserted 
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between the recognition site at the 5' end of the 
virus insertion module and the 5' end of the promoter 
module. A second Stu I restriction endonuclease 
recognition site is inserted between the 3' end of the 
3' UTR and the recognition site at the 3' end of the 
virus insertion module. If the nucleic acid sequence 
encoding a heterologous protein in the polylinker 
module does not contain a Stu I restriction 
endonuclease recognition site, then the orientation of 
the expression cassette within the virus insertion 
module can be reversed by digestion with Stu I, 
followed by religation of the digestion products. 
Exaa^le 4 below describes such dual orientation 
constructs • 

The foregoing discussion has been focused on 
the insertion of a virus insertion module containing a 
single heterologous gene into a direct ligation virus 
vector. However, it is also within the scope of this 
invention that the virus insertion module can contain 
two or more expression cassettes, each with its own 
promoter module, heterologous gene and 3' UTR, The 
promoter and 3' UTR used for each gene can be the same 
or different, in order to control the level and timing 
of expression for each heterologous gene. Such virus 
insertion modules are used to create recombinant 
viruses which express a multitude of heterologous 
proteins from a single virus insertion site. 
Alternatively, each expression cassette is assembled 
in an independent virus insertion module, each of 
which is then inserted at a different unique site 
within a single direct ligation virus vector, such 
that the recombinant virus expresses a multiplicity of 
heterologous proteins. 

In summary, this invention provides for the 
direct ligation in vitro of foreign DNA into double 
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stranded DNA insect viral genomes* No transfer 
vectors are needed. No extraneous DNA sequences, such 
as plasmid vectors, need be introduced. Fragments 
derived from the modular expression vectors are 
ligated ia vitro into the direct ligation virus 
vectors. The modular expression vectors provide for 
ready substitution of promoters, heterologous signals, 
heterologous genes and other regulatory sequences in 
the virus insertion modules. The frequency of 
recovery of the recombinant insect viruses is 
typically in the range 25-100%. This high rate 
facilitates screening for the desired constructs. 
Purification is simplified because one plaque 
purification step is eliminated. Overall, the 
invention facilitates the assembly and construction of 
recombinant double struded DNA insect viruses. 

In order that this invention may be better 
understood, the following examples are set forth. The 
examples are for the purpose of illustration only and 
are not to be construed as limiting the scope of the 
invention. 

Examples 

Unless otherwise noted, standard molecular 
biological techniques are utilized according to the 
protocols described in Sambrook et al. (23). Standard 
techniques for baculovirus growth and production are 
utilized according to the protocols described in 
Summers and Smith (10) . All references to "named" 
AcMNPV restriction fragments are based on the physical 
maps of the E2 strain of AcMNPV published in Summers - 
and Smith (10) . For example, the designation Eco RI 
"I" refers to the fragment identified as "I" on the 
linear map of restriction endonuclease fragments 
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produced by digestion of AcMNPV strain E2 DNA with Eco 
RI (Figure .1) . 

Example 1 

Construction of Transfer Vee tor NW33,2 

The concept of constructing recombinant 
baculovirus genomes by the directional ligation of 
viral and foreign DNA segments in vitro is predicated 
on (1) the identification of two or more restriction 
endonucleases that fail to cut the DNA genome of the 
baculovirus under study, and (2) the insertion of a 
synthetic linker containing these sites into a defined 
location in the baculovirus genome by conventional 
methods. In some cases, it may be possible to use an 
enzyme which cuts the viral genome infrequently if the 
recognition site{s) can be destroyed by site-directed 
mutagenesis. Optimally, the restriction enzymes 
should not only have different recognition sequences, 
but should also (upon cleavage of a susceptible DNA 
molecule) produce termini that are not readily joined 
to each other by T4 DNA ligase in vitro. These 
criteria ensure that the inserted DNA fragment will be 
joined to the viral DNA in a predefined orientation. 
Visnises which have been genetically modified to 
incorporate these special design features are referred 
to as direct ligation virus vectors. 

Two restriction endonucleases are known to 
meet these criteria for the Autoaranha califomica NPV 
(AcMNPV) strain E2 genome. Kitts et al. (5) show that 
there are no recognition sites for restriction 
endonuclease Bsu 361 in the DNA genome of the C6 
strain of AcMNPV. This site is also not present in 
the DNA genome of AcMNPV strain E2. 

Bsu 361 recognizes and cleaves (4) the DNA 
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sequence CCi^TNAGG. Those skilled in the art will 
recognize that n\unerous isoschizomers of Bsu 361 are 
known, including Eco 811, Mst II, Cvn I and Sau I, to 
name a few. As shown in Figure 2, the Bsu 361 
recognition sequence contains two of the four least 
ab\mdant trinucleotide sequences (AG6 and OCT) found 
in a portion of AcMNPV DNA* Moreover, based on this 
frequency data one would expect that palindromic 
restriction sites based on the CCT and A6G 
trinucleotides (e.g., the Bsu 361 site) would be among 
the least abundant in the AcMNPV genome. A similar 
analysis of tetranucleotide sequence frequencies 
(Figure 3) can be used to identify potential non- 
cutting enzymes among those restriction endonucleases 
that recognize 8 bp palindromic secpiences. Of the 
five predicted least abundant classes of palindromic 
8-mers, only one class (built on the CCTG and CKQQ 
tetranucleotides) contains the recognition site of a 
known restriction endonuclease. This enzyme, Sse ' 
83871, recognizes the sequence CCTGCA^GG and is the 
second known enzyme which does not cut the genomic DNA 
of AcMNPV strain E2. Interestingly, the recognition 
secpience for Sse 83871 also has the same general 
sequence pattern (CCT [NJ AGO) as Bsu 361 (where x=:l-5) . 
Those skilled in the art will recognize that the 
frequency data presented in Figures 2 and 3 can be 
used to estimate the likelihood that other enzymes, ^ 
including those with non- palindromic recognition 
sequences, might be useful for introducing unique 
recognition sites into the viral genome. 

Based on the identification of Bsu 361 and' 
Sse 83871 as enzymes that can be used to prepare 
direct ligation virus vectors for the E2 strain of 
AcMNPV, two complementary oligonucleotides (oligo 32. 
and oligo 33} containing the recognition sites for 
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these enzymes are chemically synthesized and 
incorporated into transfer vectors that can be used to 
insert the sites by a conventional method into 
predefined locations in the AcMNPV genome. The 
5 sequences of these oligonucleotides are: 



Oligo 32: 5' -CCTC»LG6GCil6CTTAA66CA6Ci3GACC66CAGCCT6CAG6-3 
(SEQ ID N0:1) 

Oligo 33: 5' -CCT6CA6GCT6CCG6TCC6CT6CCTTAA6CT6CCCTGA66-3 
10 (SEQ ID NO: 2) 



In their doxible- stranded (annealed) 
configuration, these two oligonucleotides constitute 
the "Bsu-Sse linker". To anneal the linker, 100 pmol 

15 each of oligonucleotides 32 and 33 are diluted into 20 

1x1 of a buffer containing 10 mM Tris-HCl (pH 7.5), 100 
mM NaCl and 1 lAM EDTA. The mixt\ire is then heated to 
78^0 for 10 minutes, transferred to 65^C and incubated 
for 20 minutes, and then slow cooled to room 

2 0 temperature . 

One site in which the Bsu-Sse linker is 
inserted is the Eco RV restriction enzyme recognition 
site (GATiATC) located 92 bp upstream of the 
translation initiation codon of the AcMMPV polyhedrin ^ ' 

25 gene. This site is frequently used as a site for 

foreign gene insertion to produce recombinant virusek ' 
that are genetically stable and capable of producing 
large quantities of recombinant protein (24,25) . To 
construct a plasmid in which the Eco RV site of 

30 interest is the only Eco RV site present, the 

approximately 7 kb AcMNPV Eco RI "I" fragment, which 
contains the polyhedrin gene, is inserted into the 
unique Eco RI site of pUC19. One of the resulting 
clones identified as containing the Eco RI 

35 fragment is designated NW32.3. To insert the Bsu-Sse 
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linker into the xinique Eco RV site of NW32.3, 0.3 pmol 
of MW32.3 DMA are linearized by digestion with Eco RV 
and ligated to five pmol of the non-phosphorylated 
double -stranded linker in a volume of 10 microliters. 
Excess linker is removed by electrophoresing the 
ligation mixture on a 1% low melt agarose (BioRad, 
Richmond, CA) gel and isolating the 10 kb DNA band. 
Approximately one- tenth of this DNA is used to 
transform competent E. coli strain HBlOl cells. One 
of the resulting plasmids, designated NW33.2, is 
sequenced to confirm the integrity and orientation of 
the Bsu-Sse linker (Figure 4) • 



Example 2 

Construction of the Direct Ligation 
Virus Vector 6.2.1 

A reconibinant virus containing the Bsu-Sse , 
linker at the Eco RV site upstream of the polyhedrin 
gene is produced by homologous DMA recombination in 
cultured Sf9 cells co-transf acted with the transfer 
vector NW33.2 and VL941-500i5-gal viral DNA. VL941- 
500/S-gal is a derivative of the E2 strain of AcMNPV in 
which a part of the polyhedrin gene has been 
substituted with a segment of DNA that contains the E. 
coli /8-galactosidase gene (26) . As such, VL941-500/7- 
gal is unable to produce occlusion bodies (polyhedra) 
and is phenotypically occlusion-negative (occ*) . Since 
the virus formed by homologous DNA recombination 
between NW33.2, which contains a functional polyhedrin 
gene, and VL941-500iS-gal is expected to be occlusion- 
positive (occ*) , this trait is used to identify 
potential recombinants. 

VL941-500/J-gal viral DNA is prepared from 
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extracellular virus obtained from S£9 cells infected 
with VL941-500iS-gal virus. Two micrograias of MW33.2 
plasmid DMA and 1 fig of VL941-500i5-gal viral DNA are 
co-transfected into Sf9 cells by the calcium phosphate 
co-precipitation method. Cell supematants are 
collected 5 days after transfection and occ^ viruses 
are isolated by three rounds of placpie purification. 
One occ* plaque, designated 6.2.1, is used to infect 1 
X 10^ Sf9 cells to produce a Passage 1 (PI) virus 
stock. 

The presence of the Bsu-Sse linker in the 
6.2.1 viral DNA is verified by PGR analysis of 
extracellular virus particles using as primers oligo 
32 (see Example 1) and "PVLReverse" , which anneals to 
the viral DNA approximately 320 bp downstream of the 
site of insertion of the Bsu-Sse linker. 

PVLRever se : 5 ' - GGATTTCCTTGAAGAGAGTGAG- 3 ' 
(SEQ ID N0:3) 

Virus is prepared for PGR analysis 
essentially as described by Malitschek and Schartl 

(27) . Four microliters of the PI virus stock is first 
digested for one hour at 55 ""G with 200 ^g/ml pronase 
in a 25 fil reaction containing IX Buffer A (10 mN Tris 

(pH 8.3), 50 mM KCl, 0.1 mg/ml gelatin, 0.45% Nonidef^ 
P40 (Shell Oil Co.), and 0.45% Tween"" 20 (ICI 
Americas) ) • The pronase is then inactivated by 
heating to 95®C for 12 minutes. For PGR the pronase- 
treated virus is mixed with 50 pmol of each of the two 
oligonucleotide primers in a 50 fil reaction containixig 
200 fM dNTPs (this is a mixture of nucleotides that 
contains 200 fM each of dATP, dOTP, dGTP and dTTP) , 
1.5 mM MgGl^, IX Buffer A and 2.5 \inits AmpliTag"" DNA 
polymerase (Perkin-Elmer Getus, Norwalk, GT) . The 
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sample Is sxibjected to 25 cycles of aiaplification, 
each consisting of 1 minute at 94 (denaturation 
step), 1.5 minutes at 55^C (annealing step), and 2.5 
minutes at 72 (extension step). The 25 cycles are 
followed by a 7 minutes extension step at 72 ^C. One- 
fifth of the reaction mix is electrophoresed on a 1.8% 
agarose gel to confirm the presence of the 320 bp 
asiplif ication product. 

Viral DNA is isolated from the 6.2.1 
extracellular virus and further characterized by 
restriction enzyme analysis. Digestion with Eco RI, 
Bsu 361, and Sse 83871 is used to confirm the presence 
of the unique Bsu 361 and Sse 83871 sites in the Eco 
RI "I" fragment of 6.2.1. 

Example 3 

Construction of p74 -Deficient Direct Ligation 
Virus Vector A4000 

A direct ligation virus vector incorporating 
the Bsu- Sse linker (Example 1) into the p74 gene of 
AcMNPV is produced by conventional methods using a 
transfer vector which contains sufficient viral DNA 
sequences to ensure efficient homologous recombination 
with wild type AcMNPV DNA. Unlike the 6.2.1 direct 
ligation virus vector, in which the Bsu- Sse linker is 
incorporated into an imtranslated region of the 
genome, the Bsu-Sse linker in the current example is 
inserted into the viral genome in a manner which is 
intentionally designed to disrupt the integrity of the 
p74 open reading frame. This results in the formation 
of a p74-def ective virus which produces occlusion 
bodies that are not orally infectious (13) • 

Figure 5 shows the general scheme for 
assembling the transfer vector used to construct the 



p74 -deficient direct ligation virus vector. The 
approximately 9.9 kb Bst EII "E" fragment, which 
contains the p74 gene, is isolated from AcMNPV viral 
DNA and inserted into a plasmid vector. An 
approximately 1200 bp Zho I/Hpa I fragment encoding 
amino acids 24-429 of the p74 protein is replaced with 
a polylinker containing the Bsu-Sse linker to produce 
the Ap74-1 transfer vector. The transfer vector Ap74- 
1 is comprised of 4750 bp of 5' flanking sequences, 
residues +1 - +69 of the p74 open reading frame, a 64 
bp polylinker (which includes the 40 bp Bsu-Sse linker 
sequence) , p74 gene sequences from bp +1287 to the 
termination codon at +1937 (see Figure 5; SEQ ID 
N0:4), followed by 1796 bp of 3' flanking secpaences. 
Samples of an E. coli strain HBlOl harboring this 
transfer vector have been deposited by applicants' 
assignee with the Americsm Type Culture Collection, 
12301 Parklawn Drive, Rockville, Maryland 20852, 
U.S.A., and have been assigned ATCC accession number 
68988. 

A recombinant virus containing the Bsu-Sse 
linker inserted into a mutated p74 open reading frame 
is produced by homologous recombination in cultured 
Sf9 cells CO- transf acted with Ap74-1 and wild type 
AcMNPV (strain E2} viral DNA. The desired recombinant 
virus is identified by PCR using one oligonucleotide 
that is specific for the polylinker sequence in the 
deleted gene (location denoted by leftward arrow in 
Figure 5) and a second oligonucleotide that is 
specific for the 3' end of the p74 gene coding region 
(location denoted by rightward arrow in Figure 5) . 
Samples of this virus, which is desicfnated AcMNPV 
strain A4000, have been deposited by applicants' 
assignee with the American Type Culture Collection, 
12301 Parklawn Drive, Rockville, Maryland 20852, 
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U.S.A., and have been assigned ATCC accession number 
VR 2373. 

Example 4 

Construction of Bsp MI -baaed Modular Expression 
Vectors KW44.1 and NW46,50 

To utilize the special design features of 
direct ligation virus vectors, the termini of the DNA 
fragment to be inserted into the viral genome should 
be compatible (i.e., readily ligated by T4 DNA ligase) 
with the termini formed by double digestion of the . 
direct ligation vector with the two xiniquely cutting 
restriction enzymes. Figure 6 displays an example of 
an expression vector design that is intended for use 
with a direct ligation virus vector such as AcMNPV 
strains 6.2.1 or A4000. In this example, the Bsu 361 
and Sse 83871 recognition sites flank the ends of a 
tripartite expression cassette that is conqposed of the 
following modules: (1) a promoter module, which is 
used to regulate gene transcription; (2) a polylinker 
module, which facilitates insertion of the 
heterologous DNA sequences whose expression is 
desired; and (3) a 3' untranslated region (3' UTR) , 
which provides a site for primary trauascript 
processing and polyadenylation. The region bounded by 
the outermost Bsu 361 and Sse 83871 sites is defined 
as the virus insertion module. The internal Sse 83871 
site marked with an asterisk in the polylinker module 
in Figure 6 is destroyed when the Bsp MI site is used 
to insert heterologous gene secpiences into the modular 
expression vector. This internal site is eliminated 
in the pMEV series of vectors described in Exaaple 5. 
Similarly, the internal Bam HI site marked with an 
asterisk in Figure 6 is not required and is also 
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eliminated in the pMEV series of vectors. 

Following insertion of the desired 
heterologous gene sequences into the polylinker 
module, the virus insertion module is excised from the 
plaemid vector by double digestion with Bsu 361 and 
Sse 83871 and inserted by DNA ligation in vitro into a 
Bsu 361/Sse 83871 doxxble cut direct ligation virus 
vector, such as AcMNPV strains 6.2.1 or A4000. 



additional features worth noting. The first is the 
presence of Stu I recognition sites at both ends of 
the tripartite expression cassette. If the 
heterologous gene 8ec[uences inserted into the 
polylinker module do not contain Stu I sites, the 
orientation of the entire expression cassette can be 
reversed in the virus insertion module by digesting 
the plasmid with Stu I and religating the pieces. The 
second feature is designed to facilitate the fusion of 
an exogenous open reading frame (beginning with a 
suitable translation initiation codon, such as ATG) 
with the 3' terminus of a natural or synthetic 5' 
untranslated region (5' UTR) , such that no extraneous 
linker sequences are introduced between the 3' 
terminus of the 5' UTR and the initiation codon. This 
is accomplished by the precise placement of a Bsp MI 
recognition site near the 5' terminus of the 
polylinker module. Bsp MI belongs to a class of Type 
II restriction endonucleases that cuts both strands of 
the DNA duplex at sites which lie outside (and on the 
same side) of its recognition sequence. Moreover, the 
cuts in each strand are staggered in such a way that 
the ends of the fragments have 5' protruding termini 
that can be used as template: primer coaiplexes for a 
DMA polymerase, such as the Klenow fragment of E. coli 



The design depicted in Figure 6 has two 
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DNA polymerase I (23) (SEQ ID N0:5) : 



i -1 

5 ! I ! 1 1 1 1 i 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 

f 

10 Bsp MI digestion 

i 

-1 

NNNNNNNNN NNNNN NMNGCAGGT . 

IIMIIIM llllll 

NNNNKNNNNNNNN NMMNC6TCCA. 



DNA Polymerase I (Klenow fragment) plus dNTPs 

20 -i -1 

NNNNNNNNNKMNN NNNNN NKWGCAGGT 

IIIIMIIIMII iMIIIIIMIill 

NNNNNNNNNNKNN NNNKN NNNCGTCCA 

25 As can be seen from this example, if the Bsp 

MI site is placed in the correct orientation 4 bp 
downstream of the 3' terminus of the 5' UTR 
(underlined nucleotides shown above), one DNA end 
produced by this process corresponds exactly to the 3' 

30 end of the 5' UTR. The 5' UTR is then joined directly 

to a blunt ended- fragment whose sequence begins with 
the AT6 (or other) initiation codon of the desired 
open reading frame. As described below, such 
fragments are easily prepared by PCR techniques. By 

35 convention, the 3' terminus of the 5' UTR sequence is 

designated as position -1 and is used as the landmark 
for all numerical references in the virus insertion 
module. In all of the vectors described herein, the 
5' UTR is incosrporated as part of the promoter module 

40 and has the complete nucleotide sequence of the 5' UTR 

naturally associated with the promoter in that module. 

The scheme for constructing Bsp MI -based 
modular expression vectors containing the AcNNPV 6.9K 
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gene promoter and 3' UTR is illustrated in Figure 7. 
The 6.9K gene is a "late" gene that encodes a small 
arginine-rich DNA-binding protein used for packaging 
viral DNA into nucleocapsids (28,29). The 5' and 3' 
non-coding sequences flanking the 6.9K open reading 
frame are isolated by PCR amplification using the 
AcMNPV Hind III "E" fragment as template. The 
oligonucleotide primers used for the amplification 
reactions are each composed of two fxinctional regions. 
The 5' portion of each oligonucleotide is not 
homologous to the ^AcMNPV template and is used to 
incorporate specific restriction sites into the final \ 
PCR product. The 3' portion of each oligonucleotide 
is homologous to sequences in the Ac21NPV genome that 
define one of the termini of the amplified region. 

The oligonucleotides used for PCR 
amplification of the module containing the 6.9K gene 
promoter and 5' UTR are NW oligo 1 (SEQ ID NO: 6), 
which anneals to the (-) strand approximately 200 bp 
upstream of the 6.9R translation initiation codon and 
contains Xho I, Bsu 361, and Stu I recognition sites, 
and PD oligo 23 (SEQ ID N0:7), which anneals to the 
(•f) strand ixmnediately upstream of the 6.9K translation 
initiation codon and contains Bam HI and Bsp MI 
recognition sites. The sequences of these primers and 
of the PCR amplification product (SEQ ID NO: 8) are - ^ 
presented in Figure 8. 

The oligonucleotides used for PCR 
anplification of the 6.9R gene 3' UTR are NW oligo 2 
(SEQ ID NO: 9), which anneals to the (-) strand 
immediately downstream from the translation stop codon 
of the 6 • 9K open reading frame and contains 
recognition sites for Xba I, Eco RI, Nco I, Bam HI, 
Sma I and Kpn I, and NW oligo 3 (SEQ ID NO: 10), which 
anneals to the (-►) strand approximately 200 bp 
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downstream of the translation stop codon and contains 
recognition sites for Stu I, Sse 83871 and Sst I. The 
sequences of these primers and of the PGR 
amplification product (SSQ ID NO: 11) are presented in 
Figure 9« 

The amplification reactions for the 6.9 K 
promoter module and 3' UTR module are conducted in 
separate GeneAmp tubes (Perkin-Elmer Cetus, Norwalk, 
CT) according to the following procedure. Fifty 
picomoles of the appropriate primer pair (i.e., 50 
pxaol of each oligonucleotide) are combined with 250 pg 
of AcMNPV Hind III fragment "H" DNA (see Figure 1) in 
a 50 fil reaction mixture containing 200 dNTPs, 1.5 
sAf MgCl^/ IX Buffer H and 2.5 \mlts AmpliTag'^ DNA 
polymerase (Perkin-Elmer Cetus, Norwalk. CT) . The 
samples are first subjected to 5 rounds of 
anplification consisting of 1 minute at 94 
(denaturation step), 1.5 minutes at 45 ""C (annealing 
step), and 2.5 minutes at 72*»C (extension step). This 
is followed by 20 cycles consisting of 1 minute at 
94®C (denaturation step), 1.5 minutes at 60**C 
(annealing step), and 2.5 minutes at 72 *C (extension 
step) . The last extension step is extended an 
additional 7 minutes. 

The amplification products are extracted 
once with chloroform, once with phenol : chloroform, and 
then precipitated with ethanol. The fragment 
containing the presumptive promoter module is digested^ 
with Zho I and Bam HI (see Figure 8) and the fragment 
containing the presingptive 3' UTR module is digested 
with Xba I and Sst I (see Figure 9) . Each of the 
desired fragments is then isolated by electrophoresis 
on a 1.8% low melt agarose gel. The promoter fragment 
is inserted into the polylinker of Bluescript SK* 
(Stratagene, La Jolla, CA) between the Xho I and Bam 
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HI Bites, and the 3' UTR fragment is inserted into the 
polylinker of a separate Bluescript SR^ plasmid between 
the Xba I and Sst I sites (see Figure 7) . The plasmid 
identified as containing the 6.9K promoter module is 
5 designated NW39.2, while that containing the 6.9K 3' 

UTR is designated NW41.5. Both NW39.2 and NW41.5 are 
sequenced to verify the integrity of the 6.9K gene 
segments and flanking linker sequences. 

To construct a complete Bsp Ml-based modular 

10 expression vector, MW39.2 and NW41,5 are digested with 

Xba I and Sst I and the fragments are resolved by 
electrophoresis on a 1.2% low melt agarose gel. The . . 
3.1 kb fragment derived from MW39.2 and the 200 bp 
fragment derived from NW41.5 are extracted from the 

15 gel and ligated together • A clone containing the 

desired promoter, polylinker and 3' UTR modules is 
identified by restriction enzyme analysis and is 
designated NW44.1. 

To obtain a vector in which the ea^ression 

20 cassette has the opposite orientation within the virus 

insertion module, NW44.1 is first digested with Stii I. 
The 2.9 kb Stu I fragment is purified by gel 
electrophoresis, dephosphorylated to prevent self- 
ligation, and re-ligated to the 450 bp NW44*1 Stu I 

25 fragment containing the expression cassette. A clone 

containing the Stu I insert in the opposite 
orientation relative to KW44.1 is identified by 
restriction enzyme analysis and designated NW46.50. 

30 TgyaTOpl e 5 

Construction of Esp 3 I -based Modular Expression 
Vectors pMEVli pMgV2, pMEV3 and pMEV4 



The modular expression vectors pMEVl, pMEV2, 
35 pMEV3 and pMEV4 are constructed from NW46.50 by 



- 43 - 



substituting the promoter- containing Pst I/Xba I 
fragment of 2m46.50 with Pst I/Xba I-digested 
fragments containing the AcMNPV DA26 (pMEVl) , 6.9K 
(pMEV2) , polyhedrin (pMEV3) and 35K (pHEV4) viral gene 
promoters. The DA26 and 35K genes are expressed at €ui 
early stage in the life cycle of the virus (i.e., 
before the onset of DNA synthesis) (30,31) • As noted 
earlier, the 6.9K gene encodes a "late" class 
structural protein, which is expreaBed after the onset 
of DNA synthesis (28) . The polyhedrin gene belongs to 
the class of genes that are expressed "very late" in 
the virus life cycle and encodes the major structural, 
component of the viral occlusion bodies. 

As depicted in Figure 10, the design of the 
Esp 3I-based vectors is a refinement of the Bsp MI- 
based model, in which (1) the redundant Bam HI and See 
83871 sites in the polylinker module are eliminated, 
and (2) the Bsp MI recognition site is replaced by an 
Esp 31 site. Esp 31 belongs to the same general class 
of Type II restriction endonuclease as Bsp MI, in that 
it cuts outside of its recognition sequence and 
produces 5' protruding termini that can be filled in 
by DNA polymerase. Experience indicates, however, 
that Esp 31 has a more robust activity than Bsp MI euid 
is the preferred enzyme when all other factors perzoit 
its use (e.g., when there are no Esp 31 sites in 
either the promoter module or the 3' UTR module). To 
use Esp 31 in the manner illustrated earlier for Bsp 
MI, its recognition site must be placed in the correct 
orientation 1 bp downstresun of the 3' end of the 5' 
UTR. 

As described in Example 4 for the Bsp MI- 
based vectors, the promoter fragments used in 
constructing the Esp 3 I -based vectors are formed by 
PGR amplification of cloned viral DNA using promoter- 
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specific pairs of oligonucleotide primers. The 
primers are designed so that the amplified promoter 
segments have the following general structure: (1) a 
5' terminal 22 bp heteropolymeric synthetic secpience 
with recognition sites for restriction endonucleases 
Sst I, Sse 83871 and Stu I (in that order); (2) a 
segment of viral DNA that extends from a point 100-350 
bp upstream of the predominant transcriptional start 
site of the gene to the 3' terminus of the 5' UTR 
(i.e., position --l with respect to the trcmslation 
initiation codon) ; and (3) a 3' terminal 23 bp 
heteropolymeric region with recognition sites f or . . 
restriction endonucleases Esp 31 and Xba I (in that 
order) . The location and orientation of the Esp 31 
recognition site places the cleavage sites between 
positions -5 and -4 in the (+)str€md and between 
positions -1 and +1 in the (-) strand. 

The template used to prepare each promoter, 
the sequences of the primers and the sequences of the 
anqplifled PGR products are shown in Figures 11 (DA26 
promoter module) (SEQ ID NOS:12-14), 12 (3SK gene 
promoter module) (SEQ ID NOS:15-17), 13 (6.9K gene 
promoter module) (SEQ ID KOS: 18-20) and 14 (polyhedrin 
gene promoter module) (SEQ ID NOS:21-23). 

For each axnplif ication reaction, 50 pmol of 
the appropriate primer pair are mixed with 250 pg of 
template DNA in a 50 fil reaction mixture containing 10 
sM Tris-HCl (pH 8.3), 50 vM KCl, 1.5 mM I^l,, 200 /M 
dNTPs, 100 /xg/ml gelatin and 2.5 \uiits AmpllTaq™ DNA 
polymerase (Perkin-Elmer Cetus, Norwalk. CT) . For the 
DA26, 6.9K and polyhedrin promoter modules, the 
samples are then sxibjected to 2 rounds of 
amplification consisting of 1 minute at 94 
(denaturation step), 1.5 minutes at 40^C (annealing 
step), and 2.5 minutes at 12^C (extension step). This 
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ie followed by 15 cycles of 1 sdnute at 94 ®C 
(denaturation step), 1.5 minutes at 60*'C (annealing 
step), and 2.5 minutes at 72 (extension step). The 
last extension step is programmed to run an additional 
7 minutes. For the 35K promoter module, the sample is 
asplified through 25 cycles of 1 minute at 94 ?C 
(denaturation step), 1.5 minutes at 55^C (annealing 
step), and 3.0 minutes at 72 (extension step). As 
in other reactions, 7 minutes is added to the last 
extension step. 

Each reaction is terminated by the addition 
of EDTA to 10 mM and Sarkosyl (sodixm N- 
lauroylsarcosine) to 0.2% (w/v) . The products are 
then extracted once with chloroform, once with 
phenol : chloroform and precipitated with ethanol. The 
DMA samples are redissolved in an appropriate buffer 
and then digested with Pst I (which recognizes the 
central six basepairs [CTOCAiGl of the Sse 83871 site) 
and Xba I . Each presumptive promoter fragment is then 
purified by gel electrophoresis on a 1.2% low melt 
agarose gel and ligated to a 3.2 kb Pst I/Xba I vector 
fragment prepared from MW46.50 (see Figure 7). This 
fragment contains the polylinker module, 3' UTR module 
and Bluescript SK-i- framework of NW46.50. The desired 
recombinamts are identified by restriction enzyme 
amalysis and DNA sequence determination. 
Representative isolates of each eaepression vector are 
designated pMEVl.l for the DA26 promoter, XMBV2.1 for 
the 6.9R gene promoter, ptSEVZ.l for the polyhedrin 
gene promoter and pMEV4.1 for the 35K gene prcnnoter. 
Samples of cui E. coli strain DH5q; harboring plasmid 
pMEVl.l (AC0064.1) have been deposited by applicants 
with the American Type Culture Collection, 12301 
Parklawn Drive, Rockville, Maryland 20852, U.S.A., on 
April 7, 1993, and have been assigned ATCC accession 
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xiusiber 69275. 

Example 6 

Construction of Ebp 3 I "based Modular Expression 
5 Vectors Containing the Drosophila hsp70 Gene 

In some cases, it may be advantageous to use 
an insect cell promoter rather than a viral promoter 
to direct transcription of a foreign gene in a 

10 baculovirus -based expression system. In particular, 

Morris and Miller (32) report that a Drosophila hsp70 
(major heat shock) gene promoter functions at a 
comparable or better level than the AcMNPV ETL (early) 
gene promoter in directing expression of a 

15 chloramphenicol acetyltransf erase (CAT) reporter gene 

in a variety of insect cell lines. To construct an 
expression vector that uses an insect cell promoter to 
control foreign gene expression and that can be used 
with the direct ligation virus vectors, a DMA segment 

20 of Drosophila ft^r DNA containing the hspTO 

promoter and 5' XJTR is anqplified by PCR and 
stibstituted into one of the modular expression vectors 
described above. The sequences of the primers for 
this reaction and the predicted sequence of the 

25 amplified fragment, which contains an Esp 31 site in 

the presuscptive polyllnker region, are presented in 
Figure 15 (SEQ ID KO:24) . The procedures for PCR 
aaqplification of the hsp70 promoter module, and for 
Inserting this module into an NW46. 50 -based ea^resslon 

30 vector are as described in Example 5 for the AcMNPV 

35K promoter module. A resulting clone with the 
desired structure is designated pMEVS. 
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Construction of Esp 3I-based Modular Expression 
Vectors Containing an Alternative 3^ TTTR 
and the hrS enhancer 

AcMNPV contains five regions of homologous 
DNA sequence, designated hr l to hr5, that are widely 
interspersed along its genome (22,33) • Each region is 
500-800 bp in length and contains variations of 
several repeated sequence motifs, one of which (IR24) 
(34) contains an Eco RI recognition site. Functional 
studies have shown that regions such as hr5 are 
complex cis-acting regulatory domains that can enhance 
the transcriptional activity of at least some linked 
early viral genes (e.g., the 35K gene) by as much as 
300- to 1000-fold (35) . In addition, recent evidence 
suggests that the ^ elements may also serve as 
origins for DNA synthesis (36) . 

The hrS element lies downstream and 
immediately adjacent to the AcMNPV 35K gene, and is 
therefore well suited for use as an alternative 3' UTR 
module. Fig\ire 16 displays the sequences of two 
oligonucleotides (SEQ ID NOSt25,26) for the PCR 
amplification of a segment of the AcMNPV genome that 
begins just upstream of the 3' terminus of the 35K 
gene and extends through all six IR24 repeats (marked 
by the Eco RI sites) of hrS. The conditions used for 
PCR axQ)lification of the hrS domain are the same as 
those described for the amplification of the 35R 
promoter module in Example 5. After purification, the 
PCR product (SEQ ID NO: 27) is digested with Bam HI and 
Xho I and the presusrptive hrS enhancer module is 
isolated on a 1% low melt agarose gel. This module is 
then ligated with gel purified Bam Hl/Xho I vector 
fragments prepared from each of the Esp 3I-based 
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modular expression vectors described in Exanqples 5 and 
6. The result is a new series of vectors in which the 
6.9K gene -derived 3' UTR module is replaced by the hr 5 
module. Plasmids with the desired structures are 
identified by restriction enzyme analysis and are 
designated pNEVlA (with the DA26 gene promoter 
module), pMEV2A (with the 6.9K gene promoter module), 
pMEV3A (with the polyhedrin gene promoter module) , 
pMEV4A (with the 35K gene promoter module) and pMEVSA 
(with the DrosoDhila melanoaaster hsp70 promoter 
module) • 

Example 8 

Construction of Modular Expression Vectors 
Containing Codon-Qptimized or Native 
Sequence AalT Genes 

One application in which the direct ligation 
technology and the vectors described in Examples 4-7 
are particularly useful is the design and optimization 
of recombinant viral insecticides. These are viruses, 
especially baculoviruses, whose insecticidal 
properties have been enhanced by the addition of one 
or more foreign genes that encode insect -specific 
toxins (14/15,37); peptide hormones (38) or enzymes 
(21) . 

The peptide AalT, which is found in the 
venom of the North African scorpion Androc tonus 
australis, is an exaiiqple of such an insect-specific 
toxin (15) » When AalT is injected into the body 
cavity of an insect larva, it binds selectively to 
voltage-sensitive sodium channels and causes a 
contractile paralysis. Chronic administration of the 
toxin, which can be achieved by infecting insect 
larvae with AalT -producing viruses, is associated with 
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a prolonged state o£ paralysis and eventual death. 

Transfer vectors are constructed for the 
insertion of AalT genes Into the polyhedrln gene 
region of AcMNPV, These transfer vectors are 
derivatives of the published pVL1393 (4) and pVL985 
(26) vectors. In one vector, the AalT coding region 
is inserted between the Bam HI and Eco RI sites of 
pVL1393 and has the same nucleotide sequence (SEQ ID 
NO: 45) as the coding region of the AaHITl cDNA 
described by Bougis et al. (39), and is used in 
conjunction with the native nucleotide sequence 
encoding the native AalT signal peptide (SEQ ID 
NO;28) • In seven other vectors, the AalT gene is 
inserted into the Bam HI site of pVL985 and consists 
of a codon-optlmlzed nucleotide sequence (SEQ ID 
NO: 29, nucleotides 49-258) for the mature AalT toxin, 
which Is linked In each vector to a separate codon- 
optimized sequence for one of seven different Insect 
signal peptides identified below (SEQ ID NOS:29 
(nucleotides 1-48) ,32,34,36,38,41,43) . One such 
construct, containing the codon-optimlzed AalT gene 
linked to the signal peptide of a Drosoohila cuticle 
gene, is designated pACOOSS.l. Figure 17A depicts the 
region of pACOOSS.l which contains the cuticle signal 
and AalT gene inserted into the unique Bam RI site 
located between residues +34 and +177 of the 
polyhedrln gene. The ATT sequence at +1 indicates the 
mutated translation start codon in the parental pVL985 
vector. Sauries of an E. coll strain HBlOl harboring 
the transfer vector pAC0055.1 have been deposited by 
applicants with the American Type Culture Collection, 
12301 Parklawn Drive, Rockville, Maryland 20852, 
U.S.A., and have been assigned ATCC accession number 
S9166. In an alternative embodiment, the native, 
rather than codon optimized, nucleotide sequences 
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encoding the seven different insect signal peptides 
are used (SEQ ID NOS:33, 35,37,39,40,42,44) . 

The toxin coding segments of these transfer 
vectors are recovered by PGR for subsequent insertion 
into modular expression vectors. The PGR strategy 
(Figure 17A) smd the sequence of the amplified 
fragment (Figure 17B) are exes^lified for the 
Cuticle/AalT gene. The ( + ) strand primer used for each 
reaction is an oligonucleotide of 25-27 bases whose 5' 
terminus coincides with the AT6 translation initiation 
codon of the gene to be amplified. The specific 
sequences of the (-»-) strand primers used to suaplify . 
each AalT gene are listed below, followed by 
identification of the insect species signal peptide 
which is the source of the primer used: 

5'-ATG AAG TAG GTG 666 GTG 66G CTG ATG-3' 
(esterase- 6 signal from Drosophila melanoaaster ; SEQ 
ID N0:41, nucleotides 1-27) 

5'-ATG TAG AAA GTG AGG GTG TTG GTG ATG-3' 
(adipokinetic hormone signal from M^'^'^V^^ sexta ; SEQ 
ID N0:34, nucleotides 1-27) 

5'-ATG TTG AAG TTG GTG ATG ATG TGG 6GG-3' 
(cuticle signal from Drosophila melanoaaster ; SEQ ID 
MrO:29, nucleotides 1-27) 

5' -ATG GGG 6GT AAA TTG 6TG 6TG GTT GT6-3' 
(apolipophorin signal from Manduca sexta ; SEQ ID 
NO: 36, nucleotides 1-27) 

5' -ATG AAA GTG GTG GTG GTG TTG GGG ATG-3' 
(pBMHPG-12 signal from Boaibyx morl; SEQ ID NO: 32, 
nucleotides 1-27) 

5' -ATG GGC 6TC GTG GTG GTG TTG GGG TGG- 3' 
(sex specific signal from Bombyx mori; SEQ ID NO: 43, 
nucleotides 1-27) 

5' -ATG TTG AGG TTG GGT ATT GTG GTG TTG- 3' 
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(chorion signal from Bonbvx mori; SEQ ID NO: 38, 
nucleotides 1-27, except that nucleotide 25 in the 
primer is a T instead of a C) 

5'-ATG AAA TTT CTC CTA TTG TTT CTC G-3' 
(native signal for AalT; SEQ ID NO: 28, nucleotides 1- 
25) 

The first seven primers are used with a 
codon optimized gene encoding AalT (SEQ ID NO: 29, 
nucleotides 49-258); the last primer is used with the 
native gene encoding AalT (SEQ ID NO: 45) . 

The (") strand primer in each case is the . , , 
PVLReverse primer (SEQ ID N0:3) (see Example 2), which 
hybridizes to a common site located about 35-40 bp 
downstream of the 3' terminus of the AalT gene (i.e., 
in the polyhedrin gene) • The conditions for the PGR 
reaction are essentially as described for the 35K 
promoter module in Example 5. After purification the 
reaction products are treated with the Klenow fragment 
of E. coli DNA polymerase I in the presence of all 
four dNTPs to ensure that the PGR products have blimt 
5^ termini* The 3' terminus of each toxin coding 
fragment is then defined by digesting the PGR products 
with either Bam HI (for the codon-optimized AalT 
genes) or Eco RI (for the native sequence AalT gene) , 
and the fragments are purified by electrophoresis on a 
1.8% low melt agarose gel. Figure 17B depicts the 
complete nucleotide sequence of the PGR amplified 
codon optimized cutide/AalT coding region (SEQ ID 
NO:29, nucleotides 1-258). 

To prepare the Esp 3 I -based modular 
eacpression vectors for toxin gene insertion, each 
vector is digested with Esp 31 and the resulting 5' 
protruding termini are filled in by the action of E. 
coli DNA polymerase I (Klenow fragment) in the 
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presence of all four dMTPs. Part of each preparation 
is then digested with Bam EI (for insertion of the 
codon- optimized gene fragments) and part is digested 
with Eco RI (for the native sequence gene fragment) • 
The vector is separated from the liberated polylinker 
fragment by electrophoresis on a 1% low melt agarose 
gel and then ligated in separate reactions to the 
appropriate AalT- encoding gene fragments. A schematic 
representation and the complete nucleotide sequence of 
the virus insertion module designated AC0076.1 (SEQ ID 
NO: 30) formed by inserting the Cuticle/AalT coding 
region into pMEVl (which contains the AcMNPV DA26 
promoter) is presented in Figure 18. 

Sample 9 
Preparation of 6>2.1 A4onn 
Viral DNAs for Ligation 

To prepare the 6.2.1 and A4000 viral DMAs 
for gene insertion by ligation in vitro, the DMAs are 
linearized by sequential digestions with Sse 83871 and 
Bsu 361, and then separated from the szaall Bsu-Sse 
linker fragment by gel filtration chromatography. In 
a typical preparation, forty micrograms of 6.2.1 or 
A4000 viral DNA are digested for 2 hours at SVC with 
100 units of Sse 83B7I (Takara Biochemical, Inc., 
Berkeley, CA) in a 250 /xl reaction containing 10 soM 
Tris pH 7.5, 10 nM MgCl,, 1 m dithiothreitol (DTT) , 50 
jM MaCl, and 0.01% BSA. The reaction mixture is then 
adjusted to 100 mM NaCl and 50 mM Tris HCl, pH 7.9, 
and the DNA is digested for 2 hours at 37 with 100 
units of Bsu 361 (New England Biolabs, Beverly, MA) • 
The reaction is then terminated by adding SDS to a 
final concentration of 1% (w/v) , NaCl to a final 
concentration of 0.3 M and EDTA to a concentration of 
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10 nM. Thereafter, the DNA is chroma tographed on a 
"poly-prep" coluann (BioRad Laboratories, Richmond, CA) 
containing a 2 ml bed volume of Sephacryl-300 
(Pharmacia, Piscataway, NJ) equilibrated with 10 mM 
Tris-HCl (pH 8.0), 1 EDTA, 0.1% SDS, and 0.3 M 
NaCl. Twelve 150 ^1 fractions are collected. Ten 
microliters of each fraction are anialyzed by gel 
electrophoresis to identify fractions containing the 
viral DNA. These fractions are pooled, extracted once 
with phenol : chloroform, and the viral DNA is then 
precipitated with etheuiol. The DNA is resuspended in 
TE (10 mM Tris-HCl (pH 8.0), 1 mM EDTA) at a 
concentration of 0.2-1 fig/iil and stored at 4^C. To 
determine if the viral DNA has been linearized 
conpletely, an aliquot is digested with Eco RZ and 
analyzed by gel electrophoresis. Viral DNA exhibiting 
a 7 kb Eco RI fragment has not been digested 
completely with Bsu 361 and Sse 83871 and is not used. 

Example 10 

Insertion Of Foreign DNA Into The Direct Ligation 
Vims Vector 6.2.1 Bv Ligation In Vitro 

The efficiency of obtaining recombinant 
viruses by ligating foreign DNA in vitro into the 
unique Bsu 36I/S8e 83871 cloning site of linearized 
6.2.1 viral DNA is demonstrated with the 2.9 kb Bsu 
36I/S8e 83871 fragment of NW44.1, which consists 
mainly of DNA sequences from the Bluescript cloning 
vector (Figure 7) . 

This fragment is purified by digesting 
plasmid NW44.1 sequentially with Sse 83871 and Bsu 361 
cuid then separating the digestion products on a 1% low 
melt agarose gel containing 40 mM Tris-acetic acid (pE 
7.8), 1 mM EDTA and 0.5 ng/ml ethidium bromide. Aft^r 
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electrophoresis, a gel slice containing the 2.9 kb Bsu 
36I/S6e 83871 vector fragment is carefully excised. 
To extract the DNA from the gel, the slice is diluted 
with 3 volimes of a buffer containing 20 mM Tris-HCl 
(pH 7.5), 0.4 M sodium acetate and 1 vM EDTA. The 
mixture is heated to 65*'C until the gel slice is 
melted, then cooled to 37*^0 and extracted with an 
ecpial volume of water -saturated phenol (equilibrated 
to room temperature) . After extraction, the phases 
are separated in a microfuge (15,000 rpm for 3 minutes 
at room temperature) and the phenolic phase is 
removed. The aqueous phase and interface material are. 
re-extracted with water -saturated phenol until little 
or no precipitate remains at the interface. The 
aqueous phase is then removed and the 2.9 kb DKA 
fragment is concentrated by ethanol precipitation. 

One-half microgram of Bsu 361/Sse 83871 
linearized 6.2.1 viral DNA (see Example 9) is mixed 
with approximately 12 ng of the Bsu 361/Sse 8387Z 
fragment of MW44.1 in a 5 fil reaction mixture 
containing 25 mM Tris-HCl (pH 7.6), 5 xdM MgCl,, 1 
ATP, 1 mM DTT, 5% (w/v) polyethylene glycol-8000, and 
0.5 units T4 DNA ligase (Gibco-BRL, Gaithersburg, MD) . 
After an overnight inc\ibation at 16 the entire 
ligation reaction is used to transfect Sf9 cells. 

For transf action, 1.5 x 10^ Sf9 cells are 
plated in one well of a 6 -well cluster dish. After 
the cells have attached, the cell culture medium is 
replaced with 0.375 ml Grace's Insect cell culture '-'^ 
medium (40) . The contents of the ligation reaction 
are mixed with 0.375 ml of transf action buffer (25 mM 
HEPES (pH 7.1), 140 itiM NaCl, 125 mM CaClj) and then 
added dropwise to the plated cells. A separate well 
is similarly treated with one half microgram of 
linearized (unligated) 6.2.1 viral DNA to provide a 
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negative control. The cells are Incubated with the 
DMA for 4 hours at 27^0, washed once with 2 ml of 
Grace's Insect medium supplemented with 0*33% (w/v) 
lactalbumin hydrolysate, 0.33% (w/v) TC yeastolate, 
0.1% (v/v) Pluronic™ F-68 (Gibco BRL, Gaithersburg, 
MD) and 10% (v/v) fetal bovine serum (complete TMM-FH 
medium), and then incubated for 2 hours at 27 with 2 
ml of complete TMM-FH. The cells are then harvested 
and one- tenth of the total culture (approximately 
150,000 transfected cells) is mixed with 2 x 10^ 
xintreated Sf9 cells. The mixture is plated on a 6 cm 
tissue culture dish and the attached cells are 
carefully overlaid with 4 ml complete TNM-FH medium 
supplemented with 1.5% SeaPlague™ agarose (FMC 
BioProducts, Rockland, MB) , 100 units/ml penicillin G 
and 100 fig/ml streptomycin sulfate. Since the cells 
are harvested 6 hours after transf action (i.e., before 
the production of extracellular virus) , viral plaqpies 
are produced only by those cells which have taken up 
infectious viral DNA. Hence, the number of plagues on 
each plate provides a direct measure of the efficiency 
of the transf action event. 

Five days after plating, the dishes are 
scanned for the presence of occ*^ plaques. For one 
esqperiment, fifteen plagues are observed on a plate 
containing linearized and unligated 6.2.1 viral DMA 
(the negative control) • Sixty-nine plaq:ues are 
observed on the plate containing 6.2.1 viral DNA 
ligated to the MW44.1 Bsu 361/Sse 83871 fragment. 
Eighteen of these plaqpies are picked at random for 
further analysis. 

The plagues are transferred to individual 
wells of a 48-well cluster dish containing 7.5 x 10^ 
Sf9 cells and 0.5 ml complete TNM-FH media. After 5 
days, the extracellular virus is harvested from the 
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wells and analyzed by PGR (see Example 2) for the 
presence o£ the NW44.1 -derived Bluescript sequences in 
the viral genome. The primers used £or PGR are 
PVLReverse (see Figure 4), which anneals to the viral 
5 DNA. approximately 320 bp downstream of the site of 

insertion of the Bsu-Sse linker in 6.2.1 (Example 2), 
and Bluescript "sequencing primer", which corresponds 
to the sequence on Bluescript SR+ DNA approximately 
100 bp upstream of the insertion of the Bsu-Sse 
10 linker: 

Bluescript sequencing primer: 5' -CCAT6ATTAC6CCAA6C6C6-:3' 
(SEQ ID NO: 31) 

With this primer set, a recoiobinant virus 

15 containing the NW44.1- derived Bluescript sequences 

yields a PGR product of a^roximately 400 bp in 
length, whereas no specific PGR products are formed 
with non- recombinant 6.2.1 viral DNA. The conditions 
for PGR are as described in Exasrple 2. One-fifth of 

20 the PGR reaction is analyzed on a 1.8% agarose gel. 

All of the test samples contain the predicted 400 bp 
axnplif ication product, indicating that each of the 
eighteen rauidomly picked viruses contains the desired 
insert. This result not only demonstrates the 

25 feasibility of the direct ligation approach, but also 

shows that the efficiency of recombinant virus 
recovery is very high. 

Example 11 

30 Construction of Biologically Active Recombinant 

AalT-Expressina Viruses by Direct Ligation 



35 



To demonstrate that the direct ligation 
approach can be used to produce biologically active 
recombinant viruses, Bsu 361/Sse 87831 delineated 
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virus insertion modules are isolated from six AalT- 
containing modular esqpression vectors: pMEVl/codon 
optimized cuticle signal and codon optimized AaZT gene 
(Cuticle-AalT) , pMBV2/Cuticle-AaIT, pMEV3/Cuticle- 
AalT, pMEVl/native AalT signal and gene (AaIT-cD29A) , 
pMEV2/AaIT-cDNA and pMEV3/AaIT-cDNA. An aliquot of 
each module is ligated with an appropriate amount of 
Bsu 361/Sse 83871 linearized and purified AcMNPV A4000 
DNA and transf acted into Sf9 cells, as described in 
Example 10. In a separate experiment, two of the 
virus insertion modules (from pMEVl/Cuticle-AalT and 
pMEVl/AalT-cDNA) are ligated with linearized AcMNPV 
6.2.1 viral DNA instead of A4000 viral DNA and 
transf acted into Sf9 cells. 

Linearized A4000 and 6.2.1 viral DNAs 
treated with DNA ligase in the absence of a virus 
insertion module are used as negative controls for the 
transf action. Five days after transf action, the 
medium is removed from the transf acted Sf9 cells. 
Ten- fold serial dilutions of each transf ection 
supernatant are prepared and two 1 ml aliguots of the 
10■^ 10"* and 10"* dilutions are used to infect 1.5 x 
10^ cells in each of six 60 mm culture dishes. One 
hour after addition of virus, the virus inocula are 
removed and the cells are overlaid with agarose - 
containing medium, as described in the previous 
example. The titer of each transf ection supernatant 
is calculated by counting viral plagues, and 12 or 
more plaques are picked at random from the 10"* or lO*' 
dilutions. Each plaque is then screened by PCR for 
the presence of the desired insert using the general 
procedure described in Example 2. In each case, the 
primer set is specific for the desired recombinant 
vianxs. The virus titers of the transf ection " 
supematants (5 days post-transf ection) and the 
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frequency of recombinant virus recovery is sxunmarized 
below (pfu are plaque forming iinits) : 



Summary of Recombinant Virus Formation By Direct Ligation 


Derivatives of AcMNPV A4000 


Virus Insertion Module 


Tiler ^fu/ml) 


No. PGR Positive/No. Tested 


None 


0.71 X 10* 




pMEVl/Cuticle-AalT 


1.4 X 10* 


2/12 


pMEV2/Cuticle-AaIT 


1.6 X 10* 


5/16 • 


pMEVS/Cutide-AalT 


1.8 X 10* 


5/16 


pMEVl/AaTT-cDNA 


1.4 X 10* 


4/12 


pMEV2/AaIT-cDNA 


0.81 X 10* 


0/40 


pMEV3/AaIT-cDNA 


1.4 X 10* 


3/12 


Derivatives of AcMNPV 6.2.1 , J 


None 


1.1 X 10* 




pMEVl/CutideAalT 


1.7 X 10* 


1/12 


pMEVl/AalT-cDNA 


1.3 X 10* 


.. 3/12. . . 



One virus (A4000 containing the pMEV2/JlaIT 
20 cDMA module) is not recovered on this initial 

screening. This virus appears to replicate poorly cmd 
is recovered from a primary screening of a 
transf action supernatant isolated 2 days, rather them 
5 days, after DNA addition. The frequency with which 
25 recombinant viruses are recovered from the other " 

transf action supematants ranges from 8%- 3 3%. 

One to three recombinants identified on the 
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first round of screening for each virus are then 
subjected to one or two additional rounds of plague 
purification, and PI stocks are prepared as described 
in Example 2. Since the AcMNPV A4000 derivatives do 
not form orally infectious occlusion bodies, their 
biological activity is assessed by injecting 6,000- 
10,000 pfu of extracellular virus into the body cavity 
of fourth ins tar Heliothis virescens (tobacco budworm) 
larvae. The virus is titered by the plague assay 
method, and then diluted to 1.2-2 x 10"' pfu/ml in TNM- 
FH medium supplemented with 0.5% (v/v) red dye number 
5. Each larva is anesthetized with carbon dioxide. for 
2-5 minutes and then injected with 0.5 /il of diluted 
virus, using a Hamilton syringe eguipped with a 26 
gauge needle. The needle is inserted longitudinally 
between the last two prolegs and then moved anteriorly 
two to three body segments prior to injection. 
Following injection, each larva is inspected for the 
release of dye-stained hemolymph and discarded if 
sasKple loss is evident or suspected. The larvae are 
then maintained at 27**C in covered 4 cm^ diet cells 
(one larva per cell) and inspected visually 1-4 times 
a day for evidence of morbidity or mortality. An 
infected larva is scored as responding to the 
treatment if it is either dead or moribund. A 
moribund individual is one which is unable to right 
itself within 0.5-2 minutes after being turned on its 
back. 

For the two 6.2.1 derivatives, which do' 
produce orally infectious occlusion bodies, biological 
activity is assessed on third instar Heliothis 
virescens larvae following oral administration of 1000 
occlusion bodies on a small leaf disk. A one 71 1 
droplet containing the virus in TET buffer (50 mM 
Tris-HCl, pH 7.5/10 mM EDTA/0.1% Triton™ X-100 (Rohm 
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and Haas, Philadelphia, PA)) is added to a 5 mm 
diameter cotton leaf disk in an individual well 
containing a premoistened filter paper disk 
(approximately 4 mm in diameter - 30 ill water) . After 
the droplet dries, a single larva is placed in the 
well and the well is closed. The larvae are allowed 
to feed overnight. The next mozi:iing, larvae which 
consijme the entire leaf disk are transferred to 
individual wells containing insect diet. The larvae 
are monitored for mortality until survivors pupate. 
This procedure is also performed with a wild- type E2 
AcMNPV viral strain, which is used as a control. 

In both bioassays, greater than 95% of all 
responding larvae infected with the recombinant virus 
containing the AalT gene exhibit contractile paralysis 
prior to death. Moreover, as sxnmnarized below, all of 
the recombinant AalT viruses have a shorter mean 
response time (RT50) than wild type AcMNPV (strain E2) . 
If more than one virus isolate is tested in each virus 
group, a separate RT50 value is reported for each 
isolate . 



wo 94/28114 



PCTAJSM/06079 



- 61 - 



Bioassay Summary: Recombinant AalT Viruses 
on H, vtresctns 


Derivatives of AcMNPV A4000 


AalTgene 


Expression 
Vector 


Promoter 


RT50 
% of wild type 


Cuticle-AalT 


pMEVl 


DA26 


• 50± 4 
52± 5 


Cuticle-AalT 


pMEV2 


6.9K 


51± 2 
51± 3 


Cuiide-AalT 


pMEV3 


Polyfaedrin 


53 
64± 11 


AaTT-cDNA 


pMEVl 


DA26 


69± 3 


AalT-cDNA 


pMEV2 


6.9K 


51± 3 


AalT-cDNA 


pMEV3 


Polyhedrin 


57± 2 


Derivatives of AcMNPV 6.2.1 


Cutide-AaTT 


pMEVl 


DA26 




AalT-cDNA 


pMEVl 


DA26 


72 



Sas^les o£ a& isolate designated A4001 
(containing the cuticle/AalT gene under the control of 
the DA26 promoter inserted into the A4000 direct 

0 ligation virus vector) and o£ an isolate designated 

AlOOl (containing the cuticle/AalT gene under the 
control of the DA26 promoter inserted into the 6.2.1 
direct ligation virus vector) have been deposited by 
applicants with the American Type Culture Collection, 

5 12301 Parklawn Drive, Roclcville, Maryland 20852, 

U.S.A., on April 7, 1993, and have been assigned ATCC 
accession numbers VR-2405 (for A4001) and VR-2404 (for 
AlOOl) . 
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(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 201-B31-3244 

(B) TELEFAX: 201-831-3305 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
CCTCA660CA 6CTTAAG6CA 6C66ACC66C AGCCTGCAGG 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
CCTGCAGGCT GCCGGTCCGC TGCCTTAAGC TGCCCTGAGG 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLEOTLE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GGATTTCCTT GAAGAGAGTG AG 
(2) INFORMATION FOR SEQ ID NO: 4: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 785 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(Iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 
ATGGCGGTTT TAACAGCCGT CGATTTAACT AATGCCAGTA GGTATGCCAT ACATATGCAT 60 

CGTCTCGAGG TCCCCTGCAQ GCTGCCGGTC CGCTGCCTTA AGCTGCCCT6 AGGACGGTAT 120 

CGATAAGCTT GATAACTCGA ATCGCTATCC AAGCCAGCTC CATTGTCGGC ATCGTGCTCA 180 

TTCTATTGAC GCTGGCAGAT TTGGTTTTGG CGCTATGGGA CCCGTTCGGT TACAACAACA 240 

TGTTTCCGCG CGAGTTTCCC OACGACATGT CGCGCACGTT CCTGACTGCG TACTTTGAGA 300 

GTTTCGACAA CACCACGTCC AGAGAAATCA TAGAGTTTAT GCCCGAGTTC TTTTCGGAAA 360 

TGGTCGAAAC GGACGATGAC GCCACGTTTG AATCTCTATT TCATTTATTA GATTATGTGG 420 

CATCTTTAGA AGTTAATTCC GACGGCCAAA TGTTAAACTT GGAGGAGGGT GATGAAATTG 480 

AGGATTTTGA CGAATCTACT TTGGTGGGGC AAGCGTTAGC CACTAGCTCG CTATACACTC 540 

GCATGGAGTT TATGCAGTAC ACGTTTAGGC AAAACACACT ATTGTCTATG AACAAAGAAA 600 

ACAACAATTT TAATCAAATA ATCATGGGTT TATTTGCAAC AAACACAATT GTGGCGTTTA 660 

CAGCATTTGT TATACACACA GAACTCATAT TTTTTATATT TTTCGTAATC TTCCTAATGA 720 . 

TCACATTTTA TTACATAATC AAAGAATCGT ACGAATATTA TAAAACAATT GATTTGTTAT 780 

TTTAA 785 

(2) INFORMATION FOR SEQ ID NO: 5: ^ 

" 'i.' 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) AMTI- SENSE: NO 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
NNNNNNNNNN NNNNNNN6CA 6GT 23 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single s: r 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
A6CA6CCTC6 AGCCTCAGGC CTAT6CC6T6 TCCAATTGCA AG 42 
(2) INFORMATION FOR SEQ ID N0:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 4; 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GGACAGGATC CATTACCTGC AGGAGTTTAA ATTGTGTAAT TTATGTAG 48 
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(2) INFORMZITION FOR SEQ ID NO: 8: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 248 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
AGCAGCCTCG AGCCTCAGGC CTATGCCGTG TCCAATTGCA AGTTCAACAT TGAGGATTAC 
AATAACATAT TTAAGGTGAT GGAAAATATT AGGAAACACA GCAACAAAAA TTCAAACGAC 
CAAGACGAGT TAAACATATA TTTGGGAGTT CAGTCGTCGA ATGCAAAGCG TAAAAAATAT 
TAATAAGGTA AAAATTACAG CTACATAAAT TACACAATTT AAACTCCTGC AOQTAATGGA 
TCCTGTCC 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GATCTAGAAT TCCATG6ATC CCGGGTACCA ACCAGACATT CCACACAGC 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 42 base pairs 
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CE) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
A6CAGCGA6C TCCT6CA66C CTCAAACACA G6CAAATATT GA 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
6ATCTAGAAT TCCATGGATC CCGGGTACCA ACCAGACATT CCACACAGCC GACAGTAGCG 
AATGAACGAA GCGATTTCGT CGCCTGCCCT CGTTTGGCTT TCGACTGTTA CAAAATCATG 
TCTGCAA6AT TTTAAAYTAA GCCCGCTAAG CTCAAATAGT TTATTTTTAT TACTGTTTT6 
TAAATAAATA ACTTTATCAT TCAATATTTG CCTGTGTTTG AGGCCTGCAG 6AGCTCGCTG 
CT 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(11) MOLECULE TYPE: DNA (genoznlc) 
(ill) HYPOTHETICAL: NO 
(iv) AMTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
AGCAGCGAGC TCCTGCAGGC CTACGCGTAA TTCQATATAG AC 
(2) INFORMATION FOR SEQ ID NO: 13: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (genomic) 
(111) HYPOTHETICAL: NO 
(Iv) ANTI-SENSE: NO 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CGGATCTAGA CACGTCTCGT TCGATGTTTC GCCTTTGAAC GT 
(2) INFORMATION FOR SEQ ID NO: 14: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (genomic) 
(111) HYPOTHETICAL: NO 
(Iv) ANTI- SENSE: NO 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
AGCAGCGAGC TCCTGCAGGC CTACGCGTAA TTCGATATAG ACATGACATC AGTCGTCAAT 
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TGTATTCAAA AACAACAATT GCT6CCAAT6 TACCGTATTC AAATTACTAC ATGTATAAAT 
CTGTGTTTTC TATTGTAATG AATCACTTAA CACACTTTTA ATTACGTCAA TAAATGTTAT 
TCACCATTAT TTACCTGGTT TTTTTGAGAG G66CTTTGTG CGACTGCGCA CTTCCA6CCT 
TTATAAACQC TCACCAACCA AAGCAGGTCA TTATTGTGCC AGGACGTTCA AAGGCGAAAC 
ATCGAACGAG ACGTGTCTAG ATCCG 
(2) INFORMATION FOR SEQ ID NO: 15: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
AGCAGCGAGC TCCTGCAGGC CTCTTGATGT CTCCGATTTC 
(2) INFORMATION FOR SEQ ID N0:16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) M0LECX7LE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CGGATCTAGA CACGTCTCGT TTGCTATGGT AAAGCTCAAA 
(2) INFORMATION FOR SEQ ID NO: 17: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 440 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
AGCAGCGAOC TCCTOCAGQC CTCTTGACGT CTCCGATTTC TTTTTGGCGG CAATAAGCAC 
TCCAATGCAA ATACAAAACT TTGTCGCAAC TACTGATGTT TTCQATTTCA TTCTGAAATT 
GTTCTAAAGT TTGTAACGCG TTCTTGTTAA AGTAATAGTC CGAGTTTGTC GACAAGGAAT , 
CGTCGGTGGC GTACACGTAG TAGTTAATCA TCTT6TTGAT TGATATTTAA TTTTG6CGAC 
GQATTTTTAT ATACACGAQC 6QAGCGGTCA CQTTCTGTAA CATQAGTGAT CGTGTGTGTQ 
TTATCTCTGG CAGCGCSATA GTGGTCGCGA AAATTACACG CGCGTCGTAA CGTGAACGTT 
TATATTATAA ATATTCAACG TTGCTTGTAT TAAGTGAGCA TTTGAGCTTT ACCATAGCAA 
ACGAGACGTG TCTAGATCCG 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHTU^ICTERISTICS : 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
AGCAGC<5AGC TCCTGCAGGC CTATGCCGTG TCCAATTGCA AG 
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(2) INFORtUlTION FOR SBQ ID NO: 19: 

(i) SEQX7ENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CGGATCTAGA CACGTCTCGG TTTAAATTGT GTAATTTATG TA 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 243 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
A6CAGCGAGC TCCTGCAGGC CTATGCCGTG TCCAATTGCA AGTTCAACAT TGAGGATTAC 
AATAACATAT TTAAGGTGAT GGAAAATATT AGGAAACACA GCAACAAAAA TTCAAACGAC 
CAAGACGAGT TAAACATATA TTTGGGAGTT CAGTCGTCGA ATGCAAAGCG TAAAAAATAT 
TAATAAGGTA AAAATTACAG CTACATAAAT TACACAATTT AAACCGAGAC GTGTCTAGAT 
CCG 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 42 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
AGCAGCGAGC TCCTGCAGGC CTGACGCACA AACTAATATC AC 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
CGGATCTAGA CACGTCTCGA TTTATAGGTT TTTTTATTAC AA 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 188 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
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(xl) SEQUENCE DESCRIPTION: SEQ ID NO; 23: 
AGCAGCGAGC TCCTGCAGGC CTGACGCACA AACTAATATC ACAAACTGGA AATGTCTATC 
AATATATAGT TGCTGATATC ATGGAGATAA TTAAAATGAT AACCATCTCG CAAATAAATA 
AGTATTTTAC TGTTTTCGTA ACAGTTTTGT AATAAAAAAA CCTATAAATC GAGACGTGTC 
TAGATCCG 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 847 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
AGCAGCGAGC TCCTGCAGGC CTGTCTTGGC TTTCCTCATA TGTACATATG TATGTAAATA 
TGTAAAATAA GTCGCAACTA AATTCTAATA CATTTTTCAG AATCTTAAAT TAATTTTATC 
GTATATTAAA ACAGAAGAAA GTCCGTTAAT AGTTQATTTC ATTAACTAAA AGTACAAAAT 
AATCTTTAAT ACATATGCCG ATCAGACATT TATTGGTTTA GAAGCGCAGT ATTTTTTTTG 
CGAATACGCA TAACAAAGCG CTTCGATTAT CTTTAACATA AGTTATTTAA GCAGCCGTAT 
TTATAAAGAA ATTTCCAAAA TAAAGCGAAT ATTCTAGAAT CCCAAAACAA ACTGGTTATT 
GTGGTAGGTC ATTTGTTTGG CAGAAAGAAA ACTCGAGAAA TTTCTCTGGC CGTTATTCGT 
TATTCTCTCT TTTCTTTTTG GGTCTCTCCC TCTCTGCACT AATGCTCTCT CACTCTGTCA 
CACAGTAAAC GGCATACTGC TCTCGTTGGT TCGAGAGAGC GCGCCTCGAA TGTTCGCGAA 
AAGAGCGCCG GAGTATAAAT AGAGGCGCTT CGTCTACGGA GCGACAATTC AATTCAAACA 
AGCAAAGTGA ACACGTCGCT AAGCGAAAGC TAAGCAAATA AACAAGCGCA GCTGAACAAG 
CTAAACAATC TGCAGTAAAG TGCAAGTTAA AGTGAATCAA TTAAAAGTAA CCAGCAACCA 
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AGTAAATCAA CTGCAACTAC TGAAATCTGC CAAGAAGTAA TTATTGAATA CAAGAAGAGA 



780 



ACTCTGAATA CTTTCAACAA 6TTACCGAGA AAGAAGAACT CACACACACG AGACGTGTCT 



840 



AGATCCG 



847 



(2) INFORMATION FOR SEQ ID NOt25: 

(i) SEQX7ENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:25: 
ATTCCATGGA TCCCGG3TAC CTGTAACTAG TGCACTCAAC .40 
(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs >vr' 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
AGCAGCCTCG AGCCTCAGGC CTCCACATTG TCGACTTGCT 40 
(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(ill) HYPOTHETICAL: NO 
Uv) AMTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
ATTCCATGGA TCCCGGGTAC CTGTAACTAG TGCACTCAAC AAAAATGTAA TATTAAACAC 
AATTAAATAA ATGTTGAAAA TTTATTGCCT AATATTATTT TTGTCAGTTS STT6TCATTT 
ATTAATTTGG ATGATGTCCA TTTGTTTTTA AAATTGAACT 6GCTTTACGA CSTAGAATTCT 
ACGCOTAAAA CACAATCAAO TAYGAGTCAT AAGCTGATGT CATGTTTTGC ACACGGCTCA 
TAACCGAACT GGCTTTACGA GTAGAATTCT ACTTGTAACG CACGATCGAG TGGATGATGQ 
TCATTTGTTT TTCAAATCGA GATGAT6TCA TGTTTTGCAC ACGGGCTCAT AAACTCGCTT 
TACGA6TAGA ATTCTACQTG TAAC0CAC6A TCGATTOATO AOTCATTTGT TTTGCAATAT 
OATATCATAC AATATOACTC ATTTQTTTTT CAAAACCGAA CTTGATTTAC aGQTAOAATT 
CTACTYGTAA AGCACAATCA AAAAGATGAT OTCATTTOTT TTTCAAAACT OAACTCTCGG 
CTTTACGAGT AGAATTCTAC GTGTAAAACA CAATCAAQAA ATGATGTCAT TTGTTATAAA 
AATAAAAGCT 6ATGTCATGT TTTGCACATG GCTCATAACT AAACTCGCTT TACAAATAGA 
ATTCTACOCG TAAAACATGA TTQATAATTA AATAATTCAT TTGCAAAGCT ATACGTTAAA 
TCAAACG6AC 6TTATGGAAT TGTATAATAT TAAATATGCA ATT6ATCCAA CAAATAAAAT 
TRTAATAGA6 CAAGTCGACA ATOTGOAGGC CTGAGGCTCG AGGCTGCT 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
ATGAAATTTC TCCTATTGTT TCTCGTAGTC CTTCCAATAA TGGGG6TGCT TGGC 54 
(2) INFORMATION FOR SEQ ID NOs29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 317 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
ATGTTCAAGT TCGT6ATGAT CTGCGCCOTC CTCGGCCTGG CTGTGGCCAA GAAGAACGGC 60 
TACGCA6TCG ACTCATCCGG AAAA6CCCCC GAGTGCCTGC TCTCGAACTA TTGCAACAAT 120 
GAATGCACCA AGGTGCACTA CGCTGACAAG GGCTACTGTT GCCTTCTGTC CTGCTATTGC 18Q 
TTCGGTCTCA ACGACGACAA GAAAQTTCTG QAAATCTCTG ATACTCGCAA GAGCTACTGT 240 
GACACCACCA TCATTAACTA AGGATCCTTT CCTGGGACCC GGCAAGAACC AAAAACTCAC 300 
TCTCTTCAAG GAAATCC 
(2) INFORMATION FOR SEQ ID NO:30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 797 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 
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(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
GAGCTCCT6C AGGCCTACGC GTAATTCGAT ATAGACATGA CATCAGTCGT CAATTGTATT 
CATTAAAAAC AACAGCTGCC AATGTACCOT ATTCAAATTA CTACATGTAT AAATCTOTGT 
TTTCTATT6T AATGAATCAC TTAACACACT TTTAATTACG TCAATAAATG TTATTCACCA 
TTATTTACCT 60TTTTTTT6 AGAGGGGCTT TGTGCGACTG COCACTTCCA GCCTTTATAA 
AC6CTCACCA ACCAAAGCA6 GTCATTATTG TGCCAGGACG TTCAAAGQC6 AAACATCGAA 
ATGTTCAAGT TC6TGATGAT CTGCGCCGTC CTCG6CCTGG CTQTG6CCAA GAAGAACGGC 
TACGCAGTCG ACTCATCCGG AAAAGCCCCC GAGTGCCTGC TCTCGAACTA TTQCAACAAT 
GAATGCACCA AGGTGCACTA CGCTGACAAG GGCTACTGTT GCCTTCT6TC CTGCTATTGC 
TTCGGTCTCA ACQACGACAA GAAAQTTCTG GAAATCTCTO ATACTCGCAA QA6CTACTQT 
OACACCACCA TCATTAACTA AOGATCCCGG GTACCAACCA GACATTCCAC ACA6CC6ACA 
GTAGCQAATG AACGAAGCGA TTTCGTC6CC TGCCCTCGTT TGQCTTTCGA CTGTTACAAA 
ATCAT6TCTG CAAQATTTTA AACTAAGCCC GCTAAOCTCA AATAGTTTAT TTTTATTACT 
6TTTTGTAAA TAAATAACTT TATCATTCAA TATTTGCCTG TQTTTGAGGC CTGAGGCTCO 
A0GG6GG6CC C6GTACC 
(2) INFOKMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO " ' 

(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
CCATGATTAC OCCAAGCGCG 
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(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
ATGAAACTCC TGGTCGTGTT CGCCATGTGC GTGCCCGCTG CCAGCGCT 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(il) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
ATGAAACTTC TCGTTGTGTT CGCAATGTGC GTGCCTGCCG CCAGCGCC 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL; NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
ATGTACAAAC TGACCGTCTT CCTOATGTTC ATCGCCTTCG TGATTATCGC TGAGGCC 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
ATGTACAAGC TCACAGTCTT CCTGATGTTC ATCGCTTTCG TCATCATCGC TGAGGCC 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) M0LECX7LE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
ATGGCCGCTA AATTCGTCGT GGTTCTGGCC GCTTGCCTTCG CCCTGAGCCA CTCGGCTAT6 
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6T6C6CC0C 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

ATGGCAGCCA AGTTCGTCGT GGTTCTCGCC GCGTGCGTQG CCCTCTCGCA CAGCGCOATG 
GTGCGCCGC 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 

ATGTTCACCT TCGCTATTCT GCTCCTGTGC GTGCAAGGCT GCCTGATCCA GAATGTTTAC 
GGA 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 63 base pairs 
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(B) TYPE: nucleic acid 

(C) STRAMDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) AMTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
ATGTTTACCT TCGCTATTCT CCTTCTCTGC GTTCAGGGTT GCCTGATCCA AAATGTGTAC 
GGT 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
ATGTTCAAGT TTGTCATGAT CTGCGCAGTT TTGGGCCTGG CGGTGGCC 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(Iv) ANTI-SSKTSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
ATGAACTACG TCGGGCTGGG CCTCATCATT GTGCTGTCGT GCTTGTGGCT GGGGAGCAAT 
OCT 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
ATGAACTACG TGGOACTGGG ACTTATCATT GTGCTGAGCT GCCTTTGGCT CGQTTCGAAC 
GCG 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 



wo 94/28114 



PCT/US94/06079 



- 86 - 



AT6C6CGTCC TG6T6CTGTT GGCCTGCCTG GCAGCC6CTA GC6CT 
(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
ATGAGGGTTC TAGTACTACT GGCCTGCTTG GCCGCGGCGT CAGCC 
(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 
AAGAAGAATG GATATGCCGT CGATAGTAGT GGTAAAGCTC CTGAATOTCT TTTGAGCAAT 
TACTGTAACA ACGAATGCAC AAAAGTACAT TATGCTGACA AAGGATATTG CTGCTTACTT • 
TCATGTTATT GCTTCGGTCT AAATGACGAT AAAAAAGTTT TGGAGATTTC GGACACAAGG 
AAAAGTTATT GTGACACCAC AATAATTAAT TAA 
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What Is claimed is: 

1. A recombinant double stranded DNA 
insect virus comprising: 

(a) a double stranded DNA insect virus into 
which is inserted at least one 
recognition site for a restriction 
endonuclease which does not cut the 
viral genome to create a direct 
ligation virus vector; and 

(b) a DNA fragment containing termini 
selected such that when the direct 
ligation virus vector of (a) is cleaved 
by the appropriate restriction 
endonuclease, the DNA fragment is 
ligated in vitro into the direct 
ligation virus vector of (a) • 

2. The recombinant double stranded DNA 
insect virus of Claim 1 wherein the virus is selected 
from the group consisting of double stranded enveloped 
DNA insect viruses from the Subfamilies 
Entomopoxvirinae , Exibaculovirinae . Nudibaculovirinae : 
Ichnovirug and Bracovirug . and double stranded 
nonenveloped DNA insect viruses from the family 
Iridoviridae ^ 

3. The recombinant double stranded DNA 
insect virus of Claim 2 wherein the virus is the 
Autoqrapha calif omlca nuclear polyhedrosis virus 
(AcMNPV) . 

4. The recombinant double stranded DNA 
insect virus of Claim 1 wherein two recognition sites 
for a restriction endonuclease which does not cut the 
viral genome are inserted into the viral genome to 
create a direct ligation virus vector. 

5. The recoooobinant double stranded DNA 
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insect virus of Claim 1^ wherein the DNA fragment is 
inserted at any region of the viral genome which is 
nonessential for viral replication in cultured cells. 

6. A modular es^ression vector which 
cosqprises a plasmid vector containing a virus 
insertion module which comprises in the following 
order : 

(a) a recognition site for a restriction 
endonuclease ; 

(b) a promoter module containing a promoter 
and a 5' untranslated region (UTR) , 
where the 5' UTR extends from the 
transcription start site to the last 
base pair vAiich precedes the 
treuislation initiation codon for 
protein synthesis; 

(c) a polylinker module to facilitate 
insertion of a heterologous gene; 

(d) a 3' UTR module containing . at least a 
site for 3' terminal mRNA processing 
and polyadenylation; and 

(e) a recognition site for a restriction 
endonuclease , 

such that the recognition sites of (a) and (e) permit 
the ligation in vitro of the virus insertion module 
into a direct ligation virus vector. 

7. The modular eajpression vector of Claim 
6 wherein: (a) the restriction endonuclease 
recognition site is selected from the group consisting 
of Bsp MI and Esp 31 recognition sites; (b) the 
promoter and 5' UTR of the promoter module are 
selected from the group consisting of the heterologous 
promoter and 5' UTR from the AcMNPV 6.9K gene, the 
AcMNPV DA26 gene, the AcMNPV polyhedrin gene, the 
AcMNPV 3SK gene and the Drosonhila melanoaaster hsp70 
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gene; and (c) the 3' UTR module is selected from the 
group consisting of the (a) 3' UTR of the AcMNPV 6,9K 
gene and, (b) a region coinprising the 3' terminus of 
the 35K gene together with the AcMtTPV homologous 
region 5 (b^S) • 

8. The modular expression vector of Claim 
6 wherein the poly linker module is altered by the 
insertion of a nucleic acid sequence encoding a 
heterologous protein • 

9. The modular expression vector of Claim 
8 wherein the toxin is AalT. 

10. The modular expression vector of Claim 
8 wherein the nucleic acid sequence encoding a 
heterologous signal sequence is selected from the 
group consisting of the cuticle signal sequence from 
DrosQphila melanooaflfcer- the chorion signal sequence 
from Bombyx mopj, the apolipophorin signal sequence 
from Manduea sexta, the sex specific signal sequence 
from Bombvx mori, the adipokinetic hormone signal 
sequence from Manduea sexta, the pBMHPC--12 signal 
sequence from Bombvx mori and the esterase -6 signal 
sequence from Drosonhila melanooaster . 

11. A recombinant double stranded DNA 
insect virus comprising: 

(a) a double stranded DNA insect virus into 
which is inserted two recognition sites 
for a restriction endonuclease which 
does not cut the viral genome to create 
a direct ligation virus vector; and 

(b) a DNA fragment containing termini 
selected such that when the direct 
ligation virus vector of (a) is cleaved 
by the appropriate restriction 
endonuclease, the DNA fragment is 
ligated in vitro into the direct 
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ligation virus vector of (a) , wherein 
the DNA fragment contprises a virus 
insertion module which comprises in the 
following order: 

(1) a recognition site for a restriction 
endonuclease; 

(2) a promoter module containing a promoter 
and a 5' untranslated region (UTR) , 
where the 5' UTR extends from the 
transcription start site to the last 
base pair which precedes the 
translation initiation codon for 
protein synthesis; 

(3) a polylinker module altered by the 
insertion of a nucleic acid sequence 
encoding a heterologous protein; 

(4) a 3' UTR module containing at least a 
site for 3 ' terminal mRMA processiiig ' 
and polyadenylation; and 

(5) a recognition site for a restriction 
endonuc 1 ease . 

12. A direct ligation virus vector 
comprising a double stranded DNA insect virus into 
which is inserted at least one recognition site for a 
restriction endonuclease which does not cut the viral 
genome, such that when the direct ligation virus 
vector is cleaved by the appropriate restriction 
endonuclease, a DNA fragment is ligated in vitro into 
the direct ligation virus vector. 

13 . The direct ligation virus vector of 
Claim 12 wherein the virus is selected from the group 
consisting of double stranded enveloped DNA insect 
viruses from the Subfamilies Entomopoxvir inae . 
Eubaculovir inae . Nudibaculovirinae . Ichnovirus and 
Bracovirus, and double stranded nonenveloped DNA 
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insect viruses from the family Iridoviridae , 

14. The direct ligation virus vector of 
Claim 13 wherein the virus is the Autoqrapha 
califomica nuclear polyhedrosis virus 

(AcMNPV) • 

15. The direct ligation virus vector of 
Claim 12 wherein two recognition sites for a 
restriction endonuclease which does not cut the viral 
genome are inserted into the viral genome. 

16. The direct ligation virus vector of 
Claim 12, wherein the DNA fragment is inserted at amy 
region of the viral genome which is nonessential for . . 
viral replication in cultured cells. 
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Figure 2 





Trinucleotide Frequencies in AcMNPV DNA (n = 30.866) 




A 


0 


G 


T 


AA 
AC 
AG 
AT 


0.0434 
0.0219 
0.0103 
0.0196 


0.0228 
0.0086 
0.0119 
0.0150 


0.0132 
0.0183 
0.0050 
0.0158 


0.0300 
0.0121 
0.0120 
0.0300 


CA 
CC 
CG 
CT 


0.0274 
0.0115 
0.0154 
0.0092 


0.0131 
0.0056 
0.0152 
0.0098 


0.0087 
0.0099 
0.0111 
0.0103 


0.0166 
0.0063 
0.0201 
0.0142 


GA 
GC 
GG 
GT 


0.0158 
0.0157 
0.0078 
0.0139 


0.0113 
0.0103 
0.0116 
0.0142 


0.0080 
0.0161 
0.0057 
0.0146 


0.0140 
0.0120 
0.0098 
0.0237 


TA 
TC 
TG 
TT 


0.0228 
0.0167 
0.0156 
0.0228 


0.0136 
0.0088 
0.0154 
0.0171 


0.0093 
0.0176 
0.0131 
0.0279 


0.0197 
0.0130 
0.0244 
0.0399 1 
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Tetranudeotide Fraquendes in AcMNPV DNA (na30.e59) 




A 


C 


G 


T 


AAA 
AAC 

AAQ 
AAT 


0.01629 
0.00B42 
0.00356 
0.00839 


0.00946 
0.00239 
0.0Q388 
0.0QS63 


0.00554 
0.00670 
0.00168 
0.00541 


0.01215 
0.00534 
0.00411 
0.01059 


AwA 

ACC 

ACQ 
ACT 


0.00949 

0.00473 
0.00285 


0.00414 
0.00162 
0.00505 
0.00294 


0.00256 
0.00243 
0.00314 
0.00272 


0.00570 
6.00152 
0.00541 
0.00362 


AQA 
AGC 
AGG 
AGT 


0.00382 
0.00337 
0.00110 
0.00301 


0.00223 
0.00210 
0.00168 


0.00168 
0.00366 
0.00084 
0.00210 


0.00265 
0.00278 
0.00139 
0.00469 


ATA 
ATC 
ATG 
ATT 


0.00748 
0.00492 
0.00453 
0.00686 


0.00346 
0.00200 
0.00317 
0.00440> 


0.00281 
0.00453 
0.00262 
0.00657 


0.00586 
0.00359 
0.00547 

0.01218 


CAA 
CAC 
CAG 
CAT 


0.01 101 
0.00359 
0.00210 
0.00382 


0.00573 
0.00230 
0.00324 
0.00353 


0.00405 
0.00482 
0.00116 
0.00398 


0.00667 
0.00239 
0.00226 
0.00531 


CCA 
CCC 
CCG 
CCT 


0.00463 
0.00210 
0.00235 
0.00132 


0.00256 
0.00077 
0.00265 
0.00174 


0.00139 
0.001GS 
0.00165 
0.00126 


0.00298 
0.00106 
0.00281 
0.00197 


CGA 
CQC 
CGG 

CGT 


0.00382 

ojooaee 

0.00207 
0.00301 


0.00408 
0.00353 
0.00398 
0X0512 


0.00278 
0.00398 
0.00220 
0.00S18 


0.00473 
0.00411 
0.00291 


CTA 
CTC 
CTG 
CTT 


0.00265 
0.00265 
0.00217 
0.00132 


0.00236 
0.00191 
0.00262 
0.00288 


0.00106 
0.00320 
0.00210 
0.00453 


0.00314 
0.00207 
0.00340 
0.00544 


GAA 
GAC 
GAG 
GAT 


0.00593 
0.00385 
0.00191 


0.00372 
0.00197 
0.00230 
0.00291 


0.00210 
0.00343 
0.00129 
0.00301 


0.00411 
0.00210. 
0.00249 
0.00SS4 


GCA 
QCC 
GCG 
QCT 


0.00580 
0.00301 
0.00330 


0.00366 
0.00184 
0.00421 


O.0O288 
0.00369 
0.00343 


0.00343 
0.00181 
0.00518 


GGA 
GGC 
GGG 
QGT 


0.00288 
0.00333 
0.00123 
0.00178 


0.00194 
QJ0O%97 
0.00191 

0.00233 


0.00119 
0.00440 
o!00097 
0.00304 


0.00178 

aooi68 

0.00S6a 


GTA 
QTC 
GTG 
QTT 


0.00424 
0.00385 
0.00220 
0.00450 


0.00395 
0.00243 
0.00359 


0.00233 
0.00537 
0.00337 
O.00657 


0.00340 
0.002S9 
0.00550 
0.00868 


TAA 
TAC 
TAG 
TAT 


0.01020 
O.00G02 
0.00281 
0.00479 


0.00395 
0.00194 
0.00249 
0.00294 


0.00155 
0.00337 
O.00O87 
0.00340 


0.00709 
0.00233 
0.00320 
• 0.00861 


TCA 
TCC 
TCG 
TCT 


0.00755 
0.00340 
0.00453 
0.00304 


0.00275 
0.00136 
0.00337 


0.00194 
0.00220 
0.00294 
0.00294 


0.00453 
0.00191 
0.00677 
0.00466 


TGA 
TGC 
TGG 
TGT 


0.00S34 
0.00541 
0.00340 
0.00612 


0.00311 
000275 
0.00411 
0.00453 


0.00230 
0.00408 
0.00178 
0.00434 


0.00492 
0.00320 
0.00385 
0.00946 


TTA 
TTC 

TTG 
T7T 


0.00839 
0.00534 
0.00677 
0.01014 


0.00368 
0.00252 
0.00605 
0.00596 


0.00317 
0.00450 
0.00502 
0.01024 


0.00735 
0.00479 
0.01011 
0.01364 
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Figure 4 



EcoRI 



polyhetkin 
coding region 




Sse8387l RsrII Aflll Bsu361 

CCTGCAGGCTGCCGGTCCGCTGCCTTAAGCTGCCCTGAGG 
GGACGTCCGACGGCCAGGCGACGGAATTCGACGGGACTCC 
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Figures 



M AcMNPV 
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AGATTTGGTTTTGGCGCTATGGGACCCGTTCGGTTACAACAACATGTTTCCGCGCGAGTTTCCC.GACGA 
CATGTCGCGCACGTTCCT6ACT6CGTACTTTGAGAGTTTC6ACAACACCACGTCCAGA6AAATCATAGA 

GTTTATGCCCGAGTTCTTTTCGGAAATGGTCGAAACGGACGATGACGCCACGTTTGAATCTCTATTTCA 
TTTATTAGALTATGTGGCATCTTTAGAAGTTAATTCCGACG GCCAAATG TTAAArTtfiRAr.n.ceCT.>A 

TGAAATTGA0GATTTTGAC6AATCTACTTT66TGGGGCAAGCGTTAGCCACTAGCTCGCTATACACTCG 

CATGGAGTTTATGCAGTACACGTTTAGGCAAAACACACTATTGTCTAT6AACAAAGAAAACAACAATTT 

TAATCAAATAATCATG8GTTTATTTGCAACAAACACAATT6T6GCGTTTACAGCATTTGTTATACACAC 

AGAACTCATATTTTTTATATTTTTCGTAATCTTCCTAATGATCACATTTTATTACATAATCAAAGAATC 
GTACGAATATTATAAAACA ATTGATTT6TTATTTTAA 
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Figure? 



6.9K GENE 



ATO 



PGR ampSfication 



I StuI 



BspMI 



Sse8387l 
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PCR AMPLIRCATION OF THE AcMNPV 6.9K GENE PROMOTER 
FOR Bsp Mi-BASED MODULAR EXPRESSION VECTORS 



PRIMER SET: 



Strand 


Primer 


Sequence 


(+) strand 


NWoligol 


5' - AGCAGCCTCGAGCCTCAGGCCTATGCCGTGTCCAATTGCAAG - 3* 


H strand 


PDoiigo 23 


5* - GGACAGGATCCATTACCTGCAGGAGTTTAAATTGTGTAATnATGTAG - 3* 



TEMPUTE: AcMNPV Hind III fragment "H" 
AMPUFIED FRAGMENT: 



Xhol 



StuI 

I 1 i 



-190 
1 



-180 

I 



-170 
I 



-160 
I 



-150 
I 



aQcagcctcgagcctcaQOcCTATGC CGTGTCCAAnGCAAG TTCAACATTGAGGATTAr!AATAArATATTTAAr.fiTft*T 
tcgtcggagctcggagtccgGATACGGCACAGGTTAACGnCAAGTTGTAACTCCTAATGTTATTGTATAAATTCCACTA 



-140 -130 -120 -110 -100 -90 -80 -70 * 

I I I I I f 1 1 

GGAAAATAnAGGAAACACAGCAACAAAAAnCAAACGACCAAGACGACnAAACATATATTTGGGAGTTCAGTCGTCGA 

CCTmATAATCCmGTGTCGnCTTmAAGmGCTGGnCTGCTCAAmGTATATAAACCCTCAAGT^^^ 



-1 BspMI BamHI 



-60 -50 -^0 -30 -20 -10 

1 I 1 I I 1 I 

ATGCAAAGCGTAAAAAATAnAATAAGGTAAAAAnACAGCTACATAAAnACACAAmAAACtcctgcaggtaatggatcctgt^^^ 
TACGmCGCATmnATAAnAnCCATTmAATGTC GATGTATTTAATGTGTTAAAmGaggacotCCatt^ .-^ : 

. 
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PCR AMPLIFICATION OF THE AcMNPV 6.9K GENE 3' - UTR 
FOR Bsp MI-BASED MODULAR EXPRESSION VECTORS 



PRIMER SET: 



Strand 


Primer 


Sequence 


(+) strand 


NWoligo2 


5' - GATCTAGAATTCCATGGATCCCGGGTACCAACCAGACAnCCACACAGC - 3* 


H strand 


NWoligoS 


5* - AGCAGGGAGCTCCTGGAGGCCTCAAACACAGGCAAATAnGA - 3* 



TEMPLATE: AcMNPV Hind III fragnnent-H" 
AMPURED FRAGMENT: 

Xbal Ncol Smal 
I ^ I ^ I 1 

gatctagaattccatggatcccgggtacCAACCAGACATTCCACACAGC CGACAGTAGCGAATGAACGAAGGGATTTCGT 

ctagatcttaaggtacctagggcccatgGTTGGTCTGTAAGGTGTGTCGGCTGTCATCGCTTACTTGCTTCGCTAAAGCA 

I II II I 

EcoRI BamHI Kpnl 

CGCCTGCCCTCGmGGCmCGACTGnACAAAATGATGTCTGCAAGAnnAAAYTAAGCCCGCTAAGCTGA^^ 
GCGGACGGGAGCAAACCGAAAGCTGACAATGTmAGTACAGACGnCTAAAAnTRAnCGGGCGATrCGAGTm^^^ 

StuI SstI 

I — I rn 

nATnTTATTACTGTTTTGTAAATAAATAACTTTATCATTCAATATnGCCTGTGTnGAggcctgcaggagctcgctgct 

AATAAAAATAATGACAAAACAmATnAnGAAATAGTA AGnATAAACGGAGACAAACTccggacgtcctcgagcgacga 

I I ~ 

Sse 83871 - ■ . C 
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Figure 11 



PGR AMPLIFICATION OF THE AcMNPV DA26 GENE PROMOTER 
FOR Esp 31-BASED MODULAR EXPRESSION VECTORS 



PRIMER SET: 



Strand 


Primer 


Sequence . 


M strand 


OA26FZ 


5* - AGCAGCGAGCTCCTGCA6GCCTACGCGTAATTCGATATAGAC - 3' 


H strand 


0A26RZ 


5' - CGGATCTAGACACGTCTCGTTC6AT6TTTCGCCTTTGAAC6T - 3' 



TEMPLATE: AcMNPV Pst I fragment 'G' 

AMPUFIED FRAGMENT: 

SstI StuI -280 -270 -260 -250 -240 -230 

I II II I I I I I 

agcaBCoagc^cctocaooc ctACGCGTAATTCGATATAfiAC ATRAnATftaRTfiftTnAATT^.T/lTTrAAA»«>.A»^^^T^ 

tcgtcgctcgaggacotccooaTGCGCAnAAGCTATATCTGTACTGTAGTCAGCAGTTAACATAAGTTTTTGYYGnAA 



Sse8387l 



-220 -210 -200 -190 -180 -170 -160 -ISO 

< I I I I I I I 

GCTGCCAATGTACCGTATTCAAAnACTACATGTATAAATCTGTGTTTTCTAnGTAATGAATCACTTAACACACTTTTA 
CGACGGTTACATGGCATAAGTrTAATGATGTACATAmAGACACAAAAGATAACAmcnAGTGAAnGTGTGAAAAT 

-MO -130 -120 -110 -100 -90 -80 .711 

» I I I I I I I 

AnACGTCAATAAATGnAnCACCATTATnACCTGGTTTTTnGAeAGGGGCTTTCTGCGACTGCGCACTTCCAGCCT 
TAATGCAGTTATTTACAATAAAGT66TAATAATGGACCAAAAAAACTCTCCCCGAAACACGCTGACGCGT6AAGGTCGGA 

-60 -50 -40 -30 -20 -10 -1 . - , -'V 

I I I I I I I . : - :■ 

nATAAACGCTCACCAACCAAAGCAGGTCAnATTGTGCCAGGACGnCAAAGGCGAAACATCGAAcgagacgtgtctaoafccg-- 
ATArrTGCGAGTGGnGGTnCGTCCAGTAATAACACGGTCC TGCAAGTTTCCGCTTTGTAGCTTQCtctQcacaQatctaffq c 
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Figure 12 



PCR AMPLIRCATION OF THE AcMNPV 35K GENE PROMOTER 
FOR Esp 31-BASED MODULAR EXPRESSION VECTORS 

PRIMER SET; 



Strand 


Prlfner 


Sequence 


(f ) strand 


35KPR01 


5* » AGCA6CGAGCTCCTGCAGGCCTCTTGATGTCTCCGATTTC - 3' 


(-) strand 


35KPR02 


5' - CGGATCTAGACACGTCTCGTTTGCTATGGTAAAGCTCAAA - 3* 



TEMPLATE: AcMNPV Hind HI fragment "K" 
AMPUnED FRAGMENT: 

SstI StuI -390 "380 -370 -360 -350 

I I rn II I II 

aocaQCQaoctcctoca QocCTCrrGACGTCTCCGATTTCT TTTTGGCGGCAATAAGCACTCCAATfifiAAATArAAAArT 

tcgtcgctCfla pgacgtcc gGAGAACTGCAGAGGCTAAAGAAAAACCGCCGTTAnCGTGAGGmCGTTTAtGnTtG 

Sse8387l 

-340 -330 -320 -310 -300 -290 -280 -270 

111(1) II 
nGTCGCAACTACTGATGTITrCGAmCAnCTGAAATTGTTCTAAAGTTTGTAACGCGTTCTTGrrAAAGTAATAGTC 
AACAGCGTTGATGACTACAAAAGCTAAAGTAAGACnTAACAAGArrTCAAACAnGCGCAAGAACAAmCAnATCAG 
-260 -250 ^40 -230 -220 -210 ^200 -190 

II I f I I 1 I 

CGAGnTGTCGACAAGGAATCGTCGGTGGCGTACACGTAGTAGnAATCATCnGTrGAnGATAmAATnTGGCGAC 
GCTGAAACAGCTGnCCmGCAGCCACCGCATGTGCATCATCAATTAGTAGAACAACTAACTATAAAnAAAACCGCTG 
-180 -170 -160 -150 -140 -130 -120 -110 

I I.I I I I I I . * 

GGATnTTATATACAC6A6CG6A6C6GTCAC6nCTGTAACAT6A6T6ATC6T6TGT6TGrrATCTCT66CA6C6CGATA 
CCTAAAAATATA6TGCTC6CCTCGCCAGTGCAAGACATTGTACTCACTAGGACACACACAATA6AGACCGTC6CGCTAT 
-100 -90 -80 -70 -60 -50 ^0 -30 

1 I I I 1 I.I I 

GTGGTCGCGAAAAnACACGCGCGTCGTAACGTGAACGnTATAnATAAATAnCAACGnGCnGTAnAAGTGAGCA 

CACCAGCGCTmAATGTGCGCGCAGCAnGCACnGCAAATATAATArrTATAAGnGCAACGAACATAAnCACTCGT 
-60 -50 -40 

III. 
TTTGAGCrrTACCATAGCAAAcgagacgtgtctagatccg 
AAACTCGAAATGGTATCGTTTo|!tctQ<jac|gatc|annc 

Esp3l Xbal 
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Figure 13 



PCR AMPLIFICATION OF THE AcMNPV 6.9K GENE PROMOTER 
FOR E$p 31-BASED MODULAR EXPRESSION VECTORS 



PRIMER SET: 



Strand 


Primer 


Sequence 


{*) strand 


69KFZ 


5' - AGCAGCGAGCTCCTGCAGGCCTATGCCGTGTCCAATTGCAA6 - 3' 


H strand 


69KRZ 


5' - CGGATCTAGACACGTCTCGGTTTAAATTGTGTAATTTATGTA - 3* 



TEMPLATE: AcMNPV Hind ill fragment "H" 

AMPUFIED FRAGMENT: 

SstI Stui -190 -180 -170 -160 -150 

I II 1 I I I I I. 

aocaflcgagctcctQCaogcCTATGCCGTGTCCAATTGCAAG nCAACATTGAGGATTACAATAACATATTTAAGGTGAT 

tcgtcgctcgaogacgtccgGATACGGCACAGGnAACGnCAAGnGTAACTCCTAATGTTATTGTATAAATTCCACTA 

I 1 

Sse8387l 

-140 -130 -120 -110 -100 -« -80 -70 

>< I I I I I I 

CGAAAATATrAGGAAACACAGCAACAAAAAnCAAACGACCAAGACGAGTTAAACATATATTTGGGAGnCAGTCGTCGA 
CCnTTATAATCCTnGTGTCGnGTrrrTAAGTTTGCTGGTTCTGCTCAAnTGTATATAAACCCTCAAGTCAGCAGCT 

-60 -50 -40 -30 -20 -10 -1 

< II I 111 

ATGCAAAGCGTAAAAAATATTAATAAGGTAAAAATTACAGCTACATAAAnACACAATTTAAACcgagacgtgtcfagatccg 
TACGnTCGCAnmTATAAnATrCCATTnTAATGTCG ATGTATnAATGTGTTAAArnrGgctct gcacaflatctaflQC 

L_l L_J — "■ 

EspSI Xtal. ;• . 
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PGR AMPLIRCATION OF THE AcMNPV POLYHEDRIN GENE PROMOTER 
FOR Esp 31-BASED MODULAR EXPRESSION VECTORS 



PRIMER SET: 



Strand 


Primer 


Sequence 


strand 


PHF 


5' - AGCAGCGAGCTCCTGCAGGCCTGACGCACAAACTAATATCAC - 3V 


H strand 


. PHR 


5' - CGGATCTAGACACGTCTCGATTTATAGGTTTTTnATTACAA - 3' 



TEMPLATE: AcMNPV Hind III fragment 'F* 
AMPLIFIED FRAGMENT: 

SstI StuI -^40 -130 -120 -110 -100 -90 

I ^ I 1 ' ' » I ^ 

agcagcoaqctcctgcaflOCCtGACGCACAAACTAATATCAC AAACTGGAAATGTCTATCAATATATAGnGCTGATATC 

tcgtcgctcgaggacgtccggaCTGCGTGTnGATTATAGTGmGACCTnACAGATAGTTATATATCAACGACTATAG 



Sse8387l 

-80 -70 -60 -50 -40 -30 -20 -10 

I .1 I I I i t I 

ATGGAGATAATTAAAATGATAACCATCTCGCAAATAAATAAGTATTnACTGTinCGTAACAGTTnGTAATAAAAAAA 

TACCTCTATTAATTnACTAnGGTAGAGCGTnATnAnCATAAAATGACAAAAGCAnGTCAAAACATTATrmTT 



-1 
I 

CCTATAAATcgagacgtgtctagatccg 
GGATATTTAgctctgcacagatctaggc 

i_j i_j 

Esp3l Xbal 
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PGR AMPLIFICATION OF THE DROSOPHILA hsp70 GENE PROMOTER 
FOR Esp 31-BASED MODULAR EXPRESSION VECTORS 

PRIMER SET: 



Strand 


Primer 


Sequence 


(•*-) strand 


OMHSP70F 


5' - AGCAGCGAGCTCCTGCAGGCCTGTCTT66CTTTCCTCATATGTAC - 3* 


H strand 


0MHSP70R 


5' - CGGATCTAGACACGTCTCGTGTGTGTGAGTTCnCTTTCTCGG - 3' 



TEMPLATE: Dmsophila melanogasterlQcus 8701 

AMPLIFIED FRAGMENT: 

SstI Stui -flOO -790 -780 -770 -7B0 tm 

I II II I I « »0 .750: 

aBCagcflanctCCtOcaQOCCtGTCnGGCTT TCCTCATATGTAC ATATRTATRTAAi>T>Tr;Ti>».«T««^ 

CAGCGnGAT 



tCOtCOCtCOagoaegtCCflOaCAGAACCBAAAGGAGTATACATGTATACATACATnATACATTTTAni 



Sse8387l 



-740 -730 -720 -710 -700 -690 -680 ^ 

' I ' I I I I I 

AAnCTAATACATTTTTCAGAATCTTAAAnAATTnATCGTATAnAAAACAGAAGAAAGTCCGTTAATAGTTGATnC 
nAAGATTATeTAAAAAGTCnAGAATnAAnAAAATAGCATATAATTTTGTCnCTTTCAGGCAATTATCAACTAAAG 

-660 -650 -640 -«0 -620 ^HO -600 t,: --jaa 

' ' ' I I I I ■ I 

AnAACTAAAAGTACAAAATAATCrrTAATACATATGCCGATCAGACAmAnGGTTTAGAAGCGCAGTATTTrrTTTG 
TAAnGATTTTCATGTTTTATTAGAAAnATGTATACGGCTAGTCTGTAAATAACCAAATCTTCGCGTCATAAAAAAAAC 

-580 -670 -560 -550 -540 -530 -520 -510: " 

I i I I I I I ,| • 

CGAATACGCATAACAAAGCGCnCGAnATCmAACATAAGTTArrTAAGCAGCCGTATnATAAAGAAATTTCCAAAA 
GCnATGCGTAnGmCGCGAAGCTAATAGAAAnGTATTCAATAAATTCGTCGGCATAAATArrTCTTTAAAGGTTTT 
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-500 ^90 ^80 ^70 -460 -450 -440 <-430 

I I 1 I i I II 

TAAAGCGAATAnCTAGAATCCCAAAACAAACTGGTTATTGTGGTAQGTCAnTGTnGGCAGAAAGAAAACTCGAGAAA 

AmCGCTTATAAGATCnAGGGTmGmGACCAATAACACGATCCAGTAAACAAACCGTCTTTCTTn 

-420 -410 -400 -390 -380 -370 -360 -350 

i I I f 1 I I 1 

TTTCTCTG6CCGTTATTCGTTATTGTCTCTTTTCTTTTTGGGTCTCTCCCTCTCTGCACTAATGCTCTCTCACTCTGTCA 
AAAGAGACCGGCAATAAGCAATAAGAGAGAAAAGAAAAAGCCAGAGAGGGAGAGACGTGAnACGAGAGAGTGAGACAGT 

-340 -330. -320 -310 -300 -290 -280 -270 

i I I I I I I I ' 

CACAGTAAAGGGCATAGTGCTCTCGnGGnCGAGAGAGCGCGCCTCGAATGTTCGCGAAAAGAGCGCCGGAGTATAAAT 
GTGTCATTTGCCGTATGACGAGAGCAACCAAGCTCTCTCGCGCGGAGCnACAAGCGCTTnCTCGCGGCCTCATAmA 

-260 -250 -240 -230 -220 -210 -200 -190 

I ' I f I I I 1 I 

AGAGGCGCnCGTCTACGGAGCGACAAnCAATTCAAACAAGCAAAGTGAACACGTCGCTAAGCGAAAGCTAAGCAAATA . 
. TCTCCGCGAAGCAGATGCCTCGCTGTTAAGnAAGTTTGTTCGTTTCACnGTGCAGCGAnCGCmCGATTCGmAT 

-180 -170 -160 -150 -140 -130 -120 -110 - 

I I 1 I I I I I 

AACAAGCGCAGCTGAACAAGCTAAACAATCTGCAGTAAAGTGCAAGTTAAAGTGAATCAAnAAAAGTAACCAGCAACCA 
TTGTTCGCGTCGACnGnCGAmGmGACGTCAmCACGTTCAATTTCACTTAGTTAATTTTCATTGGTCGnGGT' 

-100 -90 -80 -70 -60 -50 -40 -30 

i 1 1 II I I I- 

AGTAAATCAACTGCAACTACTGAAATCTGCCAAGAAGTAAnAnGAATACAAGAAGAGAACTCTGAATACmCAACAA,^^ 
TCAmAGTTGACGnGATGACmAGACGGncnCAnAATAACnATGncnCTCTTGAGACnATGAAAGTTGTT 

-20 -10 -1 

i I I 

GnACCGAGAAAGAAGAACTCAGACACAcoagacototctagatccg 
CAAT GGCTCTTTCTTCTTGAGTGTGTGTQctctgcacagatctaggc 

I I I I 

Esp3l Xbal 
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PCR AMPLIFICATION OF THE AcMNPV 35K GENE 3*-UTR AND hrS 
FOR Esp 31-BASED MODULAR EXPRESSION VECTORS 



PRIMER SET: 



Strand 


Primer 


Sequence 


(*) stiand 


35KHR5A 


5* - ATTCCATGGATCCCGGGTACCT6TAACTAGTGCACTCAAC - 3' 


H strand 


35KHR5B 


5' - A6CAGCCTCGA6CCTCAG6CCTCCACATT6TCGACTT6CT -3' 



TEMPLATE: AcMNPV Hind III fragment "Q" ' '-'^ • ^' - 

AMPUFIED FRAGMENT: 

Wool Smal 35K gene end 

I I I I 35KTER i 

attccatggatcccgootaccTGTAACTAGTGCACTCAACA AAAATGTAATArrAAAC/tCAATTAAATAAATGTTGAAA 

taaggtacctagggcccatggACATTGATCACGTGAGTTGTTTTTACATTATAAnTGTGTTAAnTATTTACAACTTT 



BamHI Kpnl 

AnTATTGCCTAATATTATTTTrGTCAGnSSTTGTCAmATTAAnTGGATGATGTCCAmGT^ 
TAAATAACGGAnATAATAAAAACAGTCAASSAACAGTAAATAAnAAACCTACTACAGGTAAACAAAAAnTTAAC^^ 

EcoW 
I 

TGGCmACGAGTAGAAnCTACGCGTAAAACACAATCAAGTAYGAGTCATAAGCTGATGTCATGTTTTGCACACGGCTC 
ACCGAAATGCTCATCnAAGATGCGCATTTTGTGnAGnCATRCTCAGTATTCGACTACAGTACAAAACGTGTGCCGAG 

EcoRI 

I 

ATAACCGAACTGGCnTAGGAGTAGAAnCTACnGTAACGCACGATCGAGTGGATGATGGTCAmGTmTCAAATCG 
TAnGGCnGACCGAAATGCTCATCnAAGATGAACAnGCGTGCTAGCTCACCTACTACCAGTAAACAAAAAGTrTAaC 

EcoRt . 

AGATGATGTCATGTTTTGCACACGGGCTCATAAACTCGCmACGAGTAGAAnCTACGTGTAACGCACGA^^^ 
TCTACTACAGTACAAAACGTGTGCCCGAGTATnGAGCGAAATGCTCATCnAAGATGCACATTGCGTGCTAGCTAACTA 
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EcoRI 

GAGTCATTTGTnTGCAATATGATATCATACAATATGACTCAmGTTTTTCAAAACCGAACTTGATnACGGGTAGAAT 
CTCAGTAAACAAAACGnATACTATAGTATGnATACTGAGTAAACAAAAAGTTnGGCTTGAACTAAATGCCCATCnA 

EcoRI 

TCTACTYGTAAAGCACAATCAAAAAGATGATGTCATTTGTTTnCAAAACTGAACTCTCGGCnTACGAGTAGAATTCtA 
AGATGARCAmCGTGTTAGTTTTTCTACTACAGTAAACAAAAAGmTGACTTGAGAGCCGAAATGCTCATCTTAAGAT 



CGTGTAAAACACAATCAAGAAATGATGTCATTTGTTATAAAAATAAAAGCTGATGTCATGmTGCACATGGCTGATAAC 

GCACATTTTGTGnAGTTCTTTACTACAGTAAACAATATTnTATTTTCGAGTACAGTACAAAACGTGTACCGAGTAnG 

EcoRI 
I 

TAAACTCGCnTACGGGTAGAATTCTACGCGTAAAACATGAnGATAAnAAATAAnCATnGCAAAGCTATACCnAA 
AnTGAGCGAAATGCCCATCnAAGATGCGCATmGTACTAACTAnAATnAnAAGTAAACGTTTCGATATSCAATT 



ATCAAACGGACGTOTGGAATTGTATAATAnAAATATGGAATTGATCCAACAAATAAAATTRTAATAGAGCAAGTt^GAC 
TAGTTTGCCTGCAATACCnAACATAnATAATTTATACGnAACTAGGTTGTTTATTTTAAYATTATC TCGTTCAGCTG 

Slul xhol 

I 1 I 1 

AATGTGGAggectgaggctcgaooctgcf 

TTACACCTccggactccQagctccgacoa 
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PD oligo 31 

ATG^ 

Bam Hi 



Bam HI 



+1 
I 

ATT 




- +177 
I I 

CcggatccTTTC 



BamHI 



PVL 
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PCR AMPLIFICATION OF THE CODON-OPTIMIZED CUTICLE-AalT GENE SEQUENCE FROM 
PLASMID AC0055.1 ( transfer vector pVLSSS/Cutide:^^^^ 

PAUyiERSET: 



Strand 



I*) strand 



(*) strand 



Primer 



PD oligo 31 



PVLReverse 



Sequence 



5' - ATGTTCAAGTTCGT6ATGATCTGCGCC - 3' 



5' - G6ATTTCCTTGAAGAGAGT6A6 - 3' 



TEMPLATE: Plasmid pAG0055,1 (pVL985/Cuticle-AaIT-1) 
AMPLIREO PRODUCT: 

Met Phe Lys Phe VbI Met lie Cys Ala Val Leu Gly Leu Ala Val Ala Lys Lys Asn Gly 
tL H^^^^^ "'^^^^^ ATrTGCnrCGTC CTC GGC CTG 6CT GT6 6CC AAG AAG AAC GGC 
TAC AAG TTC AAG CAC TAG TAG ACG CGG CA6 GAG CCG G AC C6A CAC CG6 HC TTC HQ CC6 

Tyr Ala Val Asp Ser Ser Gly Lys Ala Pro Glu Cys Leu Leu Ser Asn Tyr Cys Asn Asn 
TAC GCA GTC 6AC TCA TCC G6A AAA 6CC CCC GAG TGC CTG CTC TCG AAC TAT TGC AAC AAT 
AT6 C6T CA6 CTG ACT AG6 CCT TJJ CGG GGG CTC ACG 6AC GAG AGC HG ATA ACG TTG TTA 



Glu Cys Thr 
GAA TGC ACC 
Cn ACG TGG 

Phe Gly Leu 
nc GGTCTC 
AAG CCA GAG 



Lys Val His Tyr 
AAG GTG CAC TAC 
nc CAC GTG ATG 

Asn Asp Asp Lys 
AAC GAC GACAAG 
HG CTG CTG TTC 



Asp Thr Thr 
GAC ACC ACC 
CTG TGG TGG 



He He Asn TER 
ATC AH AAC TAA 
TAG TAA TTG ATT 



Ala Asp Lys Gly Tyr Cys Cys Leu Leu Ser Cys Tyr Cys 
GCT GAC AAG GGC TAC T6T TGC CTT CTG TCC TGC TAT TGC 
CGA CTG nc CCG ATG ACA ACG GAA GAC AG6 ACG ATA ACG 

Lys Val Leu Glu lie Ser, Asp Thr Arg Lys Ser Tyr Cys 
AAA GH CTG GAA ATC TCT GAT ACT C6C AAG AGC TAC T6T 
TH CAA GAC CH TAG AGA CTA TGA GCG HC TCG ATG ACA. 

BamHI 



GGATCCTTTCCTGGGACCCGGCAAGAACCAAAAACTCACTCTCTTCAAGGAAATCC^ ^ ^ 
CCTAGGAAAGGACCCTGGGCCGTTCTTGGTTnrrGAGTGAGAGAAGTTCCTTTAGG 
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AC0076.1 



Sse8387t 



BamHI 





Kpnl 



StullKpni 



BSU361 




Sstl[ 



DA26 promoter 



6.9K 3 -U7R 



cuticle AalT 



SstI S$e36I StuI 

GAGCTCCTGCAGGCCT ACGCGTAATTCGATATAfiACATBACATCACTnRTfiAATTftTATTniiTTAAAAArikA^^fi 

CTGCCAATGTACCGTAnCAAATOCTACATGTATAAATCTGTGTTnCTAnGTAATGAATCACnAACACACT 
mAAnACGTCAATAAATGnAnCACCATTAmACCTGGTTTTTnGAGAGGGG^ 

nCCAGCCmATAAACGCTCACCAACCAAAGCAGGTCAnAnGTGCCAGGACGnCAAAGGCGAAACATCGAA 
ATG nC AAG TTC GTG ATG ATC TGC GCC GTC CTC GGC CTG GCT GTG 6CC AAG 
AAG AAC GGC TAG GCA GTC GAC TCA TCC GGA AAA GCC CCC GAG TGC CTG CTC 
TCG AAC TAT TGC AAC AAT GAA TGC ACC AAG GTG CAC TAC GCT GAC AAG GGC 
TAC TGT TGC CH CTG TCC TGC TAT TGC TTC GGT CTC AAC GAC GAC AAG AAA 
GTT CTG GAA ATC TCT GAT ACT CGC AAG AGC TAC TGT GAC ACC ACC ATC AH 

BamHI Kpnl 

AAC TAA G GATCCCGGGTACCAACCAGACAnCCACACAGCCGACAGTAGCGAATGAACGAAGCGATTTCGT 
CGCCTGCCCTCGrrTGGCTTTCGACTGTTACAAAATCATGTCTGCAAGATTTTAAACTAAGCCCGCTAAGCTCAA 



ATAGrrTATTnTATTACTGTTnGTAAATAAATAACTTTATCATTCAATAmGCCTGTGTTTGAGGCCTGAfeG 



Stul Bsu36l 



Xhol Kpnl 
CTCGAGGGGGGGCCCGGTACC 
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